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IS THE F SCALE IRREVERSIBLE?! 


RICHARD CHRISTIE 


Columbia University 


JOAN HAVEL 
Cornell Medical School 


AND BERNARD SEIDENBERG 


New York University 


NE of the most frequently used instru- 
ments in the area of personality and 
attitude measurement is the F scale 

developed by the authors of The Authoritarian 
Personality (1). It has not always been used 
judiciously, but many data have been collected 
in conjunction with it in hundreds of samples. 
It has been found to be correlated (signifi- 
cantly) with attitude measures varying from 
Arab nationalism (24) to Xenophobia (6) 
(not Xenophilia! [22] ) and with 
other variables as diverse as Necker cube 
reversals (15) and amount of out-group 
aggression in dreams (19). 

The F scale was designed as a covert 
measure at the personality level of incipient 
although it more 

as measure of 


to neglect 


tendencies, is 


interpreted 


fascistic 


commonly a 


authoritarianism. 

A feature of the F scale that has come under 
recent critical scrutiny is the unilateral word- 
ing of its items. These are so phrased that 
agreement with them is scored as indicative 
of potential fascism, disagreement non- 
Most of these criticisms stem from 
of work on 
set and indication of its im- 
portance in answering test or scale items 
(10, 11). Some subjects, it appears, have 
general tendencies to accept or reject items. 
Although this susceptibility to respond in a set 


as 
fascist ic. 
earlier 


Cronbach’s summary 


response his 


‘ This study would not have been possible without 
the generous aid of those cited below. Donald T. Camp- 
bell, N. L. Gage, and Douglas N. Jackson made pre 
publication copies of their articles available and kindly 
furnished further details of their research. Tentative 
reversals of F-scale items were critically scrutinized 
by Robert Holt, Marie Jahoda, Robert Lee, and Gud- 
man Smith and their advice was in large part accepted. 
Angus Campbell, Richard I. Evans, Warren Miller, 
Lester W. Milbrath, A. Robert Rollin, and Peggy Cook 
played a crucial role in recruiting subjects and their 
aid cannot be fully acknowledged in a footnote. Finally, 
the services of Stanley Budnitzky, Daniel Clastner, 
Ethelreda Seidenberg, and Orlando Sepulveda in 
scoring questionnaires and performing computations 
are gratefully acknowledged. 


143 


way may be indicative of a broader factor of 
acquiescence, it need not manifest itself 
uniformly. According to Cronbach, response 
set is more likely to occur when a given item 
is vague or otherwise lacking in specificity or 
is in an area with which the individual is 
unfamiliar. Such considerations are pertinent 
to many of the items in the F scale which are 
characterized by nonspecificity of referent 
and do not make sense when analyzed logically. 
F 


SET AND THE SCALE: 


SOME QUESTIONS 


RESPONSE 


Although the authors of The Authoritarian 
Personality gave more detail about their 
research procedures than is customary, they 
failed to anticipate all the critical questions 
that might be raised and there are explanatory 
lacunae. One of these is the failure to explain 
why F-scale items were unilaterally worded. 
This is in contrast to an earlier discussion of 
the explicit rationale for a similar wording of 
the E (Ethnocentrism)-scale items since it 
was recognized that this violated recommenda- 
tions of experts on scale construction (1, p. 
59). 

Three of the 38 items in the original Form 78 
F scale were worded in such a fashion that 
rejection would imply fascistic tendencies 
(Nos. 12, 20, and 28: 1, Table 1. (VII), pp. 
226-227). Two of them discriminated nega- 
tively (Nos. 12 and 20) and ranked 37th and 
38th in rank discriminatory power in the item 
analysis (1, Table 3 (VII), p. 245). These 
two items were believed to be poor because 
they were “unclear or ambiguous” (1, p. 247) 
and were dropped from further revisions of the 
F scale. The third reversal (No. 28) had poor 
discriminatory power and was not included 
in subsequent revisions of the F scale. It does 
emerge worded in the opposite direction, i.e., 
in conformity with the rest of the F-scale 
items, in Form 60 A of the F scale. 

The difficulties in constructing satisfactory 
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item reversals of the F scale that are hinted 
at in this initial attempt are highlighted by 
four recent studies in which F-scale items were 
reversed. Correlations between the 
composed of reversed items and the original 
F scale (or an abridgment) indicate little 
reversed 


scales 


success in constructing adequate 
If scales composed of these reversed 


authoritarianism” defined 


items 
items measured 
as the rejection of democratically worded items 
it would be expected that the correlation 
between them and the original F scale, which 
was based upon agreement with authoritarian 
or nondemocratic statements, would approach 
+ 1.00 
and the reversals a 
not the case in these four studies. Bass found a 
but +.20 (2), Chapman and 
ones of +.17 and —.01 in 
samples (7), Jackson, Messick, and Solley 
one of —.35 (14 Leavitt, Hax, 
Roche ones varying from —.66 to 


if subjec ts are ideologic ally consistent 
re adequate. This is clearly 


correlation of 


Campbell two 


and and 


five small samples (17 


Four independent studies leading to findings 


of negligible if not negative correlations 
and 
reversals of its items must be taken seriously. 
The lack of hig! correlations 


attributed to the influence of response set by 


between F scale scales composed of 


positive was 
all four groups of investigators. Bass even went 
so far as to three-fourths of the 
“reliable” variance and 59 per cent of the total 
variance to 622). This 
conclusion has been challenged by Messick 
und Jackson who pointed out that 
factors when they 
reflected and that a factor analysis was not 
justified given the nature of Bass’ data (20). 
Although Bass acknowledged the validity 
of this criticism he still argued that acquiescent 
response set is an important component in 
scale (4), 


a ivr 
assign 


response set (2, p. 
Bass 


misidentified his were 


results obtained with the F as did 
his two critics. 

Such an evaluation is at least partially 
consistent with other data. Cohn (6) was one 
of the first to consider seriously the possibility 
that the F scale was in part a measure of 
acquiescent tendencies. He found a correlation 

2 The sign of the correlations in these studies has 
been standardized in this discussion so that a positive 
orrelation indicates consistency of content and a 
negative one a lack of relationship between the F 
scale and a reversal scale scored so that rejection of 


1 


the items is scored as authoritarian 
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of +.41 between a “plus” scale of diverse 
items from the MMPI and an unspecified 
version of the F scale. Bass mentions finding 
only positive correlations among 13 scales of 
20 aphorisms each although the content areas 
covered by the scales varied (2, p. 617). 
Items from these scales appear to be used in a 
scale of “social acquiescence” which is re- 
ported to correlate +49, +.48, and +.16 
respectively with the F scale in three samples 
Chapman and Campbell report 
obtaining a “rather pure” measure of response 
set by scoring 74 questionnaire items that 


(3, p. 297). 


were balanced in positively and negatively 
worded direction for a general tendency to 
agree. This measure was correlated with the 
F scale to the extent of +.43 (7, p. 131). 
Leavitt, Hax, and Roche cite an unpublished 
study by Mullery in which correlations be- 
tween the F scale and acceptance of con- 
tradictory maxims of +.46, +.41, and +.38 
respectively were found in three small samples 
(17, p. 215). A subsequent study by Carey, 
Rogow, and Farrell (5) on the relationship 
between agreement with positive and negative 
aphorisms and the F scale indicated that in 
three of four samples, agreement with both 
sets of aphorisms was significantly correlated 
with F-scale scores (positive aphorisms were 
defined as affirming Christian, humanist, or 
democratic values; negative aphorisms were 
those judged to deny these ideals). Webster 
found that a measure of response set correlated 
+.26 with the F scale (26, p. 31) among Vassar 
students. 

These findings are congruent with those of 
Newcomb (21) in the Bennington study. He 
had heard Thorndike speak of the “tendency 
to agree” and although he was not familiar 
with any studies analyzing what Cronbach 
was later to cail response set he analyzed the 
P.E.P. (political-economic progressivism) scale 
in terms of this “tendency to agree” and found 
that the split-half reliability on the P.E.P. 
scale among seniors in the class of 1939 was 
+.68 when scored by subtracting the number 
of disagreements from the number of agree- 
ments (21, p. 194). The correlation between 
tendency to agree as Freshmen and Seniors 
was +.33 (21, p. 195). These and other findings 
reported by Newcomb are of particular 
interest in the present context since the P.E.P. 
scale was a modification of Stagner’s pioneering 
scale of fascist attitudes (21, p. 20). 
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rhorndike had actually conducted research 


on the tendency to agree during the 1930's. 
This material has not been published but an 


by Lorge on the 
topic at a regional psychological convention 


abstract of a paper giver 


exists (18). Cronbach also summarizes addi- 
tional materia! on reliabilities of scales in these 
studies (11). Briefly, split-half reliabilities for 
given response categories were high (.88 and 
.95), and an unspecified positive correlation 
I the 


found pe_tween 
responses on the Bernreuter and the number 


was number of “yes” 
of statements checked on a Thurstone scale. 
rhe preceding studies are similar in that the 
measure of response set is based upon state- 
ments that may vary in the extent to which 
they are ambiguous to respondents. In other 
response set is con- 


the item 


words, the presence ol 


) a certain degree with 

ontent. 

Gage, Leavitt, and Stone have used a differ- 
hnique to measure response set. They 

false information items in a 


1 a level of difficulty that their 


test 


could not achieve better than a 

The split-half reliability of an 
the number of “true” 
68. (This is identical with that 


ct be 


was + 


responses, 
found by Newcomb on the P.E.P. scale and 
suggests hypotheses about the nature of the 
political and economic attitudes of his Benning- 
A similar scale constructed by 
Gage et al. had items at a 75 per cent level of 
reliability 
a result indicating that . difficult and am- 
biguous ilems are required to elicit the acqui- 


escent set....” (12, p. 98, italics added). 


ton sample. 


difficulty and the was but +.09, 


Gage ef al. also administered, among other 
‘“‘nlus” scale and the F scale. 
The latter correlated +.44, a finding similar 
to Cohn’s. The correlation between their 


Information-True (items at chance level) test 


measures, Cohn’s 


and the F scale was +.35. This strikes us as 


the most clearcut indication of the relationship - 


between the F scale and a response set measure 
that is free from variance attributable to the 
attitudinal content of the items. Roughly an 
eighth of the variance in response to the F 
scale would then be attributabie to response 
set—at least among studies conducted on the 
usual samples of American college students 
the lower correlation obtained by Webster 
s believed to be attributable to the fact that 
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his sample was probably more intellectually 
sophisticated than the other 
samples reviewed). 

Although the weight of evidence indicates 


members of 


that responses to the F scale are affccted by 
response set, it does not support the notion 
that response set is the primary determinant. 
The lack of consistent positive correlations 
between the F scale and reversed F scales in the 
four studies cited therefore cannot be attrib- 
uted primarily to a tendency to acquiesce. 

There are other reasons for questioning an 
attempt to equate responses to the F scale 
and acquiescent response set (2, 4). The F 
scale correlates highly with many measures 
which are free from this artifact, e.g., the 
analysis of incomplete sentences reported in 
The Personality. (One of the 
oversights connected with much of the justifi- 
able methodological criticism of that research 
is that the methodologically more sophisti- 
work such as this has been largely 
ignored.) Another qualification should be 
apparent to anyone who has ever scored the F 
observed how remarkably incon- 
can be in their re- 
sponses—can a underlie such 
variability? Finally, most of us who have 
ventured outside the ivied halls have known 
individuals who emphatically endorse state- 
ments such as those embodied in the F scale. 
Yet they are not only reluctant to acquiesce to 
statements reflecting approval of any liberal 
political activity of the past quarter century 
but emphatically affirm that anyone making 
such statements is a fool if not a traitor. Among 
those who represent the quintessence of the F 
syndrome, placid agreement to explicit liberal 
statements is rare in our experience. 

The preceding considerations indicate that 
acquiescent response set is one component of 
scores on the F scale but that its importance 


Authoritorian 


cated 


and 
many 


> ale 
people 
response set 


sistent 


has been cver-emphasized in the four studies 
using reversed items. A scrutiny of the data 
suggests two possible reasons. One is that the 
reversed items which were used have de- 
ficiencies as meaningful psychological opposites 
to original items. A second reason 
appears to lie in the distributions of F-scale 


F-scale 


scores in the samples studied; these are not 
congruent with the supposition that the high 
scoring subjects are acquiescent. These points 
are worthy of further elaboration. 
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Problems in Reversing F-scale Items 

The following considerations seem impor- 
tant to us in the construction of item reversals 
which should be accepted by those who reject 
F-scale items and rejected by those who ac- 
cept them: 

1. Logical opposition to the original stems. 
This point is by far the simplest and the studies 
under consideration are not subject to criticism 
on this basis. We do not, however, consider 
logical opposition in itself a sufficient condi- 
tion for meaningful reversals. This point may 
be illustrated by one of the few items which was 
reversed in all four studies—‘“‘familiarity breeds 
contempt.” The reversals of this laconic 
aphorism were: 

Familiarity does not breed contempt. [2] 

Familiarity breeds respect more often than it does 
contempt. [7] 

We are bound to admire and respect a person if we 
get to know him well. [13] 

The more you get to know people the more you get 
to like them. [17] 


These statements are all logically incom- 
patible with the original item. Yet they differ 
in wording and, we would argue, in their 
psychological incompatibility with the original 
statement and one another. 

2. Avoidance of extremity of wording. The 
statements in the F scale are frequently 
couched in all or none terms: “everybody,” 
“the most important,” “never do anything,” 
‘no sane, normal, decent person,” “no weak- 
ness or difficulty,” etc. Since this form itself 
has been hypothesized as appealing to those 
high on the F scale and who are presumed to 
be intolerant of ambiguity it would appear 
appropriate to couch reversals in probabilistic 
form by using such qualifications as “some,” 
“many,” “one of the most important,” etc. 

In varying degrees, items used in the four 
studies failed to meet this criterion, e.g.: 


‘ 


No people are born with an urge to jump from high 
places. |2 

It is completely absurd to believe that some people 
may be born with an urge to jump from high places. [7] 

It is known with complete certainty that the urge to 
jump from high places is learned, not inborn. | 13] 

All the mysteries surrounding our lives will sooner 
yw later be cleared up through the progress of 
science. [17}* 


* Leavitt, Hax, and Roche did not reverse the “urge 
to jump” item. The italics are ours 


R. Curistre, J. HAVEL, AND B. SEIDENBERG 


The theory underlying the F scale suggests 
that a truly nonauthoritarian individual 
would be taken aback by the unqualified 
wording of such statements. Agreement with 
the underlying premise that suicidal tendencies 
are most plausibly explained by the vicissitudes 
of an individual’s life circumstances rather than 
heredity might be masked by a nonauthoritar- 
ian disinclination to agree with a dogmatic 
statement. 

3. Psychological opposition to original i'ms. 
Many of the reversals in these studies may be 
nondiscriminating because they are irrelevant 
to the psychological significance of the originals. 
In some instances we would predict that the 
reversals would be accepted by both high and 
low scorers on the F scale; in other cases that 
the reversals would not be acceptable to either 
group. Our judgments in these cases are largely 
subjective since they are based upon our inter- 
pretation of the theory underlying The 
Authoritarian Personality and familiarity with 
subsequent studies. Since it is difficult to 
formulate general rules in such a situation, we 
shall cite a few of the many statements that 
are relevant. 

Consider two reversals of 
“Nowadays when so many kinds of people 
move around and mix together so much a 
person has to protect himself especially 
carefully against catching infection or disease 
from them”: 


the statement: 


Nowadays, since democracy demands that people 
of widely different backgrounds and station mix to- 
gether, a person should not be finicky about catching 
disease from any of them. | i3] 

You need to get out anv rub elbows with all kinds 
of people to get new ideas and broaden your under 
standing of life. [17] 


The first reversal should presumably be 
accepted by low scorers since they, theoreti- 
cally at least, should not be over-anxious about 
“contamination” by others. High scorers might 
also be expected to accept the reversal since 
it is loaded in the direction of a conforming 
response; rejection of the statement implies 
rejection of democracy, and, despite their 
implicitly anti-democratic ideology, their need 
for conformity is usually associated with the 
acceptance of the symbols of the sanctioned 
democratic ethos. 

The second reversal, it is suggested, misses 
the psychological point of the original state- 
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ment which was intended to arouse anxiety 
about catching diseases from unwashed out- 
groups. Instead, a folksy democratic statement 
was used. The basic concept is unlikely to be 
rejected by low scorers although they might 
wonder about the “need.” Rejection of the 
item by high scorers would imply that they 
were intraceptive, which runs counter to 
theory. For these reasons, the item should 
theoretically be accepted by both high and low 
scorers and thus be nondiscriminating. 

rhese considerations suggest that one of the 
reasons why the scales composed of reversed 
items fail to correlate with the F scale lies in the 
nature of many of the items of which they are 
composed. Many strike us as being subject to 
rejection by low scorers because of their ex- 
tremity of wording, irrationality, or lack of 
compatibility with the psychological signifi- 
cance with items in the F scale. Other reversed 
items are believed to be nondiscriminating 
because they might reasonably be expected to 
be accepted by both high and low scorers. 
There are enough items subject to these 
strictures in the studies under review to afford 
a plausible explanation for the findings 
reported without the necessity of invoking 
response set as the crucial variable. These 
considerations are believed to be especially 
crucial for the studies of Bass, who explicitly 
attempted to construct logical rather than 
psychological reversals, and of Jackson and 
Messick whose reversals were in the style of 
the original items. 
Acquiescent Response Set and Sampling 

Three of the four samples under review used 
college samples, apparently largely composed 
of students at the beginning of their college 
training. The majority of respondents in the 
fourth (17) appear to have been highly ed- 
ucated. This point is of methodological interest 
because college samples do not tend to make 
absolutely high scores on the F scale. More 


precisely, if the range of item mean scores is 


defined as ranging from 1.0 (strong rejection of 
every item in the scale) to 7.0 (strong accept- 
ance of every item), it is found that most 
college samples reported in the literature have 
a mean score below the theoretical neutral 
point of 4.0. 

This distribution of scores is important in 
evaluating the purported relevance of response 


set to the F scale. Those in the lowest quarter 
of the typical distribution usually have an item 
mean score of well below 3.0 which indicates 
that they must fairly consistently disagree 
“slightly” or “somewhat” with the items with 
an occasional agreement or “strong’’ disagree- 
ment. Those who score in the highest quarter 
typically have mean scores in the 4.0 or 5.0 
range with a few scoring slightly higher. 

The importance of such distributions for the 
equation of high scores on the F scale with 
acquiescence is due to the fact that “neutral’’ 
responses are not customarily allowed. Any 
scores between 3.0 and 5.0 therefore have to be 
obtained by balancing off agreement and 
aisagreement with specific items. The typical 
college student “high” authoritarian cannot, 
by virtue of the scale format and scoring, be 
consistently accepting items. This fact suggests 
that if these studies are based upon typical 
college samples, the lack of positive correla- 
tions reported cannot be attributable to ac- 
quiescence response set on the part of the 
high scorers in these samples since the latter 
are inconsistent in their responses. (It is 
hypothetically possible that some of the 
reported correlations are due to “negativism” 
on the part of low scorers who reject both the 
original F-scale items and the reversals!) 

It is impossible to delineate fully the possible 
effects of the range of responses on the F scale 
in the samples used in these four studies. Bass 
(2) reports an item mean of 2.83 which is 
equivalent to one of 3.83 in the conventional 
scoring of the F scale (a constant of three rather 
than four was added to raw scores). He reports 
an SD of 2.00(2, Table 2, p. 620). This means, 
if correctly reported, that his sample was 
necessarily uniquely bimodal on the F scale 
since it would be impossible to have a normal 
distribution on a scale with a six-point range 
and a standard deviation of a full two points. 
In view of this and other errors in these data, no 
reliable estimates can be made about the 
characteristics of the sample. 

Chapman and Campbell do not report the 
mean or distribution of F scale responses for 
their two samples. They used a five-point 
response scale with an undecided category so 
that the preceding comments about the lack of 
a neutral category do not apply to their data. 
An item mean computed from unpublished 
data indicates that the item mean for their 
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second sample 2.90, just below the 
theoretical neutral point of 3.00. Since these 
data were in the form of responses to items, it 
was not possible to compute an SD based 
upon Ss. It is therefore impossible to determine 
the extent to which high scoring Ss were con- 
sistent in their acceptance of F-scale items.‘ 

Jackson, Messick, and Solley report an item 
mean of 3.10 with an SD of .67 on 42 Ss. Their 
scoring procedure differed slightly from the 
conventional one and yields a mean equivalent 
to 3.77. If their distribution of scores was 
normal, approximately one or two Ss would 
have scored above 5.0. Leavitt, Hax, and 
Roche studied two small samples of foremen, 
(n’s = 6 and 13), two of executives (n’s = 34 
and 52), and one of Unitarians (mn = 28). A 
composite plot of their data (17, Fig. 1, p. 219) 
indicates that low scoring Ss greatly out- 
numbered high scoring ones. As nearly as we 
can determine by inspection, eight Ss had an 
item mean agreement varying between “‘slight”’ 
and “somewhat” and a single S had an item 
mean above 6.0. The relatively high scorers in 
these samples thus fell largely within or slightly 
above the neutral! range. 


was 


Additional Comments 


It should be emphasized that the preceding 
comments about these studies are based upon 
an advantage that the original authors did 
not have, i.e., the opportunity to be familiar 
with one another’s work. The seemingly 
paradoxical finding in independent studies 
that F-scale reversals did not correlate appro- 
priately with the F scale raised questions which 
led to analysis of possible reasons. Chapman 
and Campbell have an insightful discussion of 


* The item mean is higher than generally found in 
schools of Northwestern’s caliber. Since Chapman 
and Campbell did not randomly sample F-scale items 
and used but 16 of them, the possibility of bias in the 
items was considered. The 14 items taken from the 
Form 40 of the F scale had an item mean (our computa- 
tions) identical with the other items in this form in 
the original samples (based upon 1, Table 9 (VII), 
p. 260). Their other two items were taken from the 
Form 60 and had an item mean of 4.46 in contrast to 
that of 3.71 on the whole scale (1, Table 6 (VII), p. 
253). Since there is reason to believe that the relative 
“pull” of F-scale items is fairly consistent in college 
samples, a slight contribution to the unexpectedly 
high item mean in this sample may be attributed 
to the popularity of these two items. The effect of 
allowing a “neutra!”’ option upon item mean scores is 
unknown 
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the problems of reversing items and because 
of the difficulties did not try to reverse more 
than ten items in their initial attempt (7, p. 
129). Since they were the only investigators 
to report an item analysis, a study of their 
items which did successfully reverse gave us 
clues as to the possible failures of their other 
items and those of the other investigators. 
Further hunches were suggested by Bass’ 
analysis of reversed items as to the degree of 
judged logicai difference from the originals, 
the strikingly negative correlation found by 
Jackson et al. with reversals that were stylis- 
tically similar to F-scale items, and the differ- 
ent correlations among samples reported by 
Leavitt ef al. 


AN EXPLORATORY INVESTIGATION OF 
REVERSED F-ScaLe ITEMS 


The present study was undertaken to ex- 
plore the implications of our examination of 
earlier studies. Clarification of the problem of 
the role of response set in the F scale required, 
we believed: (a) reversed items which were 
constructed according to the criteria previously 
specified and (6) the testing of these items on 
samples believed to contain a wide range of 
scorers on the F scale. 


The Item Pool 


It was decided that the larger the number of 
reversals attempted, the greater the probability 
of empirically finding ones which were cor- 
related with the F scale. In two studies (7, 17), 
the researchers did not test reversals of all the 
items in the F scale. In three of the studies 
(2, 13, 17), F-scale items were reversed al- 
though they had been explicitly mentioned in 
The Authoritarian Personality as being poor 
discriminators (1, Nos. 22 and 44, pp. 254-257). 
We attempted to reverse every F-scale item 
which was acceptable in its original form. 

The final version of the F scale described in 
The Authoritarian Personality contained 29 
items. We attempted to reverse the 27 which 
were found to be discriminating. One, “wars 
and social troubles may someday be ended by 
an earthquake or flood that will destroy the 
whole world,” baffled us in our attempts to 
construct a meaningful reversal. Three items 
which were in the subsequent Form 60 A 
were added to the pool. A fourth additional 
item, taken from the earlier Form 60, was also 
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udded. It had discriminated between high and 
low scorers (1, Table 6, (VII), item 41, p. 258 
but, for unstated reasons, was not included 


in subsequent revisions of the F scale. 


The Item Reversals 


After discussion of the problems of item 
reversal along the lines rmdicated above, the 
authors individually attempted to write one or 
more reversals of each item in the pool. These 
reversals were then discussed and one of them, 
or more often a modification of one, was ac- 
cepted for provisional inclusion in the reversed 
scale. The reversals were then circulated among 
colleagues familiar with the F scale; their 
critical comments were solicited and, in most 


cases, act epted. 


The Subscales 


A and B) were 
prepared. Each was composed of 15 of the 30 
original and the 
other 15 since it was believed that the presence 


['wo questionnaires (Forms 


F-scale items reversals of 
of the original and reversed form of the same 
item in the same questionnaire would lead to 
possible artifacts in response. The assignment 
of items to the two forms was based upon the 
table of 
numbering of 


random numbers and the 
serial the items within the 
questionnaire was also random. There was no 


use of a 


a priori reason for assigning items since earlier 
research had indicated that individual items in 
the F scale fell into different empirical clusters 
in different samples (8). Each scale was a 
mirror image of the other, i.e., if a given item 
its original wording on Form A 


appeared in 
its reversal was in the same serial position on 
Form B. The item reversals used, their alloca- 
tion to Form A or B, their position within the 
subscales, and the source of the original item 


are given in Table 1 


Scoring 


[he customary procedure in scoring the F 
scale was followed, respondents béing given an 
option of agreeing strongly, somewhat, or 
slightly or disagreeing slightly, somewhat, or 
strongly. No neutral or undecided option was 
allowed, but omission of an answer was scored 
Both scales were scored so that 
“authoritarianism” as 


as neutral. 
high 
indicated in Table 2. 


scores reflected 
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The Samples 

We were interested in obtaining samples 
which included Ss who would show great 
variability of scores on the F scale and were 
especially interested in finding high scorers. 
Like other investigators, we were hampered 
by the difficulty of obtaining noncollege sam- 
ples. Through the cooperation of friends, the 
scales were administered to their classes at 
two colleges that were believed to contain a 
higher proportion of high scorers than would be 
found in most colleges with which we were 
familiar. One of these was a private university 
in the Southwest, the other, a Northeastern 
public university. The scales were administered 
during the 1956 summer session when the 
student bodies were more heterogeneous than 
usual since they also contained many elemen- 
tary and high school teachers who were attend- 
ing summer school. 

Ss in each of the two college samples were 
divided in half by administering odd (Form A) 
and even (Form B) scales to adjacent Ss 
Results on four samples will be discussed: 
SW-even (n = 75), SW-odd (n = 75), NE-even 
(n = 39), and NE-odd (m = 40). Item mean 
scores and SDs on the 15-item F subscales were 
respectively: 3.81, .97; 3.39, .93; 3.26, .99; 
3.35, .93. We are unable to explain why the 
SW-odd sample scored so surprisingly much 
lower than the SW-even sample since the 
alternate forms were distributed to adjacent 
students and the two SW samples do not differ 
significantly in proportion of males (roughly 
two-thirds), age, or major in college. Roughly 
half of the NE samples were male. Fourteen 
per cent of both samples were psychology 
majors, 25 per cent were education majors, 
and 19 per cent were business adminstration 
majors. The bulk of the remainder of the NE 
samples was composed of nursing students, 
that of the SW samples of students in engineer- 
ing and liberal arts. Respondents ranged from 
freshmen to graduate students in all samples. 


RESULTS 
Item Analysis 


In analyzing the reversals, scores made by 
the Ss on the appropriate F subscale were 
divided into quarters. Two comparisons were 
then made of the scores on individual reversed 
items—the magnitude and direction of the 
individual item mean differences of those in 
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TABLE 1 
REVERSALS OF F-ScALE ITEMS 


Orig. 


No.* Reversed Statement 


Form 


31 . There are times when it is necessary to probe into even the most personal and private matters. 
37 ! . It would be a good thing if people spent more time thinking and talking about ideas just for 
the fun of it. 
3. If it weren’t for the rebellious ideas of youth there would be less progress in the world. 
. When a person has a problem or worry, it is best to face it and try to think it through, even 
if it is so upsetting that it keeps him from concentrating on other things. 
5. Insults to our honor are not always important enough to bother about. 
. There are many difficulties a person cannot overcome no ma'ter how much will power he has 
. It’s nobody’s business if someone is a homosexual as long as he doesn’t harm other people. 
It’s all right for people to raise questions about even the most sacred matters. 
It is highly unlikely that astrology wiil ever be able to explain anything. 
In the long run it is better for our country if young people are allowed a great deal of persona! 
freedom and are not strictly disciplined. 
. Even though people of all sorts mix together nowadays, you don’t have to worry very much 
about catching an infection or disease. 
Even if there were no immoral, crooked, or feeble-minded people there would still be many 
serious social problems. 
An urge to jump from high places is probably the result of unhappy personal experiences 
rather than something inborn. 
It doesn’t make much sense to divide people into groups like the weak and the strong; too 
many people are strong in some ways and weak in others. 
The findings of science may some day show that mary of our most cherished beliefs are wrong. 
People ought to pay more attention to new ideas, even if they seem to go against the Ameri- 
can way of life. 
. You may dislike a person very much, but the chances are that if you get to know him well 
you'll have more respect for him. 
One of the most important things children should learn is when to disobey authorities. 
Most people can be trusted. 
You'll do better work in the long run if you do things your own way rather than by following 
tried and true procedures. 
It is more important for this country to have a just legal and political system than a series of 
trustworthy leaders, however courageous, tireless, and devoted they might be. 
Sex crimes, such as rape and attacks on children, are signs of mental illness; such people be 
long in hospitals rather than in prison. 
Many people have too great a fear of plots hatched in secret by politicians. 
It’s only natural for people to sometimes have thoughts about hurting a close friend 
tive. 
The artist and professor are probably more important to society than the businessman or 
manufacturer. 
Most honest people admit to themselves that they have sometimes hated their parents 
Human nature doesn’t make war inevitable; man may some day establish a peaceful world. 
In spite of what you read about the wild sex life of people in important places, the real story 
is about the same in any group of people. 
People should be willing to overlook failures in the manners and unpleasant personal habits 
in other people 
Books and movies ought to give a more realistic picture of life even if they show that evil 
sometimes triumphs over good. 


. rela 


* All items are taken from 1, Table 7 (VII), pp. 255-257 except for No. 16 which was found in Table 4 (VID, p 
Table 3, p. 464 


19, 20, and 30, the originals of which are in 8, 


find workable reversals is thus 


the upper and lower quarters on the F sub- 
scales and the pattern of differences in all four 
quarters. 

Of the 30 reversals, 22 discriminated in the 
appropriate direction between high and low 
quarters on the F scale in both samples 
responding to them, four in one sample but 
not the other (Nos. 7, 11, 14, and 17), and four 
discriminated negatively in both samples 
responding to them (Nos. 1, 20, 21, and 23). 
This differs from chance expectancy at better 
than the .01 level as tested by chi square. The 


attempt to 
viewed as moderately successful. 
Individual F-scale items were 
analyzed against extreme quarters on the total 
scale. All discriminated in the appropriate 
direction but there was great variability in 
discriminatory power and mean level of ac- 


similarly 


ceptance. 
An Examination of Response Set 


Although these findings indicate that F 
scaleitemsare, at least within limits, reversible, 





Is THE F ScALE IRREVERSIBLE? 151 


they do not clarify the more basic problem as to 
how response set is linked with the F scale. 
Given the fact that the F scale has been found 
to be correlated with a host of measures that 
do not involve response set, that its construct 
validity is tenuous, and that response set 
plays some role—although not the dominant 
one in the acceptance or rejection of its items 
how best to proc eed? 

had selected our reversed 
against the criterion of scores on F subscales 


Since we items 
we decided to examine further the most suc- 
cessful five in each form. This procedure elimi- 
nated the negatively discriminating items and 
those of marginal discrimination and left only 
those found to be systematically related to 
response to the F scale. 

We also pared down the number of F-scale 
items for more intensive scrutiny to five in 
each subscale. These were the ones with the 
highest discriminatory power in the item 
analysis. So doing tended to get rid of not only 
those items which were poor discriminators but 
those which were widely rejected or accepted. 
For example, the item mean on the statement, 
“People can be. divided into two distinct 
classes: the weak and the strong,” was but 1.77 
in the SW-even sample and even those scoring 
in the high quarter tended to reject this 
their item mean score 
being but 2.0 (disagree “somewhat”). On the 
other hand, the SW-odd sample tended toaccept 
the statement, “The wild sex life of the old 
Greeks and Romans was tame compared to 


“authoritarian” item, 


some of the goings-on in this country, even in 
places where people might least expect it,” the 
item mean being 5.15. Slightly greater ac- 
ceptance than rejection of this statement on 
the part of low scorers was found (item mean 
of 4.20). The ten items (five from 
each form) used in this analysis, were Nos. 3, 
5, 13, 15, 16, 18, 25, 28, and 30 in Table 1. The 
F-scale items were the originals of Nos. 1, 6, 
8, 9, 15, 18, 20, 22, 24, and 30. 


reversed 


These short five-item scales had a mean 
part-whole correlation of .82 with the longer 
15-item forms and thus may be viewed as 
tapping the essence of the F scale. They are 
also useful since they are presumably the most 
susceptible to response set if response set is 
part and parcel of the “authoritarian” syn- 
drome. After all, items that are universally 


accepted or rejected cannot yield much infor- 


TABLE 2 
ScorING OF STRAIGHT AND REVERSED 
F-ScaLe ITEMs 


Score 


Response Siraight F Reversed F 
Agree Strongly 
Somewhat 
A Little 
No answer 
Disagree a Little 
Somewhat 
Strongly 


NW OO =) | 
SOU GN 


TABLE 3 

CORRELATIONS BETWEEN LONG AND SHORT 
Forms OF STRAIGHT AND REVERSED 

F-ScaLe Items 


Scales Being Correlated 





15 item F+ 15item F+ 5 item F+ 


and and and 

Sample F— Sitem F— 5 item F— 
SW-odd .35 .42 .45 
SW-even 14 .27 .23 
NE-odd 34 .27 aa 


NE-even .45 .66 44 


mation about differential response set among 
individuals. 

The correlations between these abbreviated 
scales and the original 15-item scales is given 
in Table 3. Since the items in the short scales 
were selected against the longer F subscales, 
it is not surprising that the magnitude of the 
correlations is similar. It is clear, however, 
that the correlations fall far from what would 
be expected if the reversals were adequate and 
response set did not play some role. 


Scores of the 299 Ss in the four samples were grouped 
and ordered in terms of their item mean response on 
the F scale. The 58 Ss making scores of 5.0 and over 
were considered to be high scorers and their responses 
to the reversed items were examined. Sixteen of these 
individuals had item mean scores of 6.0 or higher on 
the short F scale. Despite this high degree of “acqui- 
escence” with “authoritarian” items, it did not gen- 
eralize to the reversed items. Their scores here all fell 
in the neutral range (3.2 to 4.8) with the exception of 
one slightly deviant case with a score of 5.0 (disagree 
slightly). There are many ways of balancing off re- 
sponses to attain a “neutral” score and these Ss dis- 
played a most consistent inconsistency in doing so 
They used the extreme categories in answering the 
reversed items; 84 per cent of their total responses 
were in terms of “somewhat” or “strong” acceptance 
or rejection. No one accepted or rejected more than 
three or less than two of the reversed items. Further, 
there was no observable difference in the acceptance 
or rejection of specific items. 
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It is clear that an acquiescent response set does not 
account for these results. These individuals who whole 
hearted!y and consistently endorsed the most uiscrim- 
inating F-scale items displayed an enthusiastically 
random pattern of response to the reversed items. Our 
interpretation is that they have a tendency to make 
extreme responses and are ideologically somewhat 
confused. 

Those scoring from 5.0 to 5.8 (“slight” to almost 
“somewhat” agreement) on the short F scale differ in 
showing extreme variability on the reversals, the item 
Those at the 
juite different 


mean range varying from 1.8 to 5.6 
extremes of this continuum display 
behavior. 

Five of these individuals had item mean scores of 
over 5.0 on both the F and reversed F subscales, a 
higher score on the reversed scale, incidently, than 
any of 16 individuals with scores of 6.0 or over on the 
F subscales considered earlier. These are clearly ideo- 
logically consistent individuals and are moderately 
fasc istically oriented. Their modal response pattern is 
that of accepting four out of five items on the F sub- 
scales and rejecting four out of five on the reversed F 
subscales. The small » does not suffice for an evalua- 
tion of the intensity of response although there is a 
lack of use of there was not a 
single “strong” item). This 
fraction of the Ss shows no evidence of response set 


extreme categories 
approval of a reversed 
Acquiescent response set is definitely manifested in 

a group of 13 Ss with F subscale scores above 5.0 and 
reversed subscale scores of 3.0 or below. The modal 
pattern was acceptance of four of five items on both the 
The “authoritarianism” 
als on the F scale appears 


F and reversed F subscales 
manifested by these individu 
nonideologica! in nature and a reflection of a tendency 
to agree with incompatible items. 

The remaining 24 Ss in this range of scorers show a 
mixed pattern of responses, the mean on the reversed 
F subscales falling between 3.2 and 4.8. Although 
tendencies in the response set direction may be present 
among some of them, it is equally plausible that they 
tended toward a consistency which was masked by the 
analysis of a relatively small number of items 

A similar analysis was made of those scoring low on 
the F subscale. There were 21 Ss who disagreed “‘some 
what” . Thirteen of 
these had a general pattern of acceptance on the re 


or greater (scores of 2.0 or below 


versed items and may be considered consistent lows 
Eight, however, had mean reversed scores in the neutral 
range and thus resemble mirror images of our extreme 
high scorers. They differ from the latter, however, in 
not predominantly utilizing extreme responses to items 
38. 40 
“slight,” 


agree categories 


and 23 per cent of their responses fell in the 
“somewhat,” and “strong” agree 
There is thus no evider 
set among these respor 

The moderately low scoring Ss who scored from 2.2 
be identified as ideo 


to 3.0 on the F subscales can 


logically consistent or as subject to response set just as 


those with 


moderately high scores. A total of three Ss 
response, i.e., they 
nded to both 


Eighteen Ss were identified as consistent! 


a consistent negative 


lieaoree th iter . nhecal 
aisagree wit! items on subscaies 


low since 


hev accepted the reversed items while rejecting the 


ones. The aining 


20 Ss in this range had 


the sewereed eralee and 
he reversed scales and 


Within these samples, at least, three general 
findings emerge: (a) there is a much higher 
proportion of ideological consistency among 
low than high scorers, (5) response set is more 
prevalent among those who moderately agree 
than those who agree extremely and although 
slightly present among those who moderately 
disagree is not found among extreme disagreers, 
and (c) there is a higher proportion of ideo- 
logical inconsistency on the reversed items 
among extreme high scorers than among other 
groupings. 

The consistency of the low scorers is con 
gruent with the findings of Leavitt ef al. They 
combined their groups and correlated the 
original F-scale items used and their reversed 
scale and found a positive correlation in the 
bottom third of the combined samples of +.45 
(the high third on the F scale had a correlation 
on the two scales in the negative direction: 
—.32). Although we have not replicated their 
analysis, an inspection of our data indicates 
(as can be easily guessed from the earlier 
discussion of response patterns) that the low 
scorers are responsible for the overall positive 
correlations; the correlation between original 
and reversed items among the high scorers in 
these samples would not differ appreciably 
from zero. 

The greater presence of response set among 
those with moderate rather than extreme 
attitudes is not at all surprising. This is pre- 
cisely what would be expected according to 
Cronbach’s formulation of the conditions under 
which response set is most likely to be found. 
A person sophisticated enough to reject most 
of the F-scale items out of hand has a well 
enough formulated ideology to know where he 
stands and is unlikely to find reversed items 
vague or unfamiliar. An extreme high scorer is 
of different persuasion but his devotion to the 
irrational does not indicate bland acceptance of 
contrary statements, at least in light of current 
theory and knowledge. Although the preceding 
point seems exceedingly obvious, it has not 
been prominent in_ recent 
response set. One possible value of the present 


discussions of 


findings, limited as they may be, may be the 
reiteration of basic considerations about the 
relationship between extremity and intensity 
of attitudes. These, in conjunction with 
Cronbach’s observations, that it is 
precisely those persons who do not have strong 
attitudes pro or 


apt to display response set. 


suggest 


con who should be most 
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The one unexpected finding was the complete 
abandon with which the extreme high scorers 
rejected or accepted the reversed items. There 
is considerable discussion in The Authoritarian 
Personality about the irrationality of such 
individuals but no indication that it would take 
such a remarkable form. 


FURTHER STUDIES 


Although the preceding findings appeared 
plausible, they were descriptive and left several 
matters in doubt. First, how stable were they? 
Our procedure of concurrently analyzing items 
from samples is similar but not 
identical 
original sample and cross-validating on a new 


separate 
to doing an item analysis on an 
Further, our results were obtained on 
samples of summer school students who can 


one. 


be viewed as representative of neither the 
more usual college samples nor of the general 
population. the findings in 
other samples therefore appeared called for. 
[The second matter of concern was the un- 
unticipated behavior of the extreme high 
scorers on the reversed items. Some confusion 


Rey lication of 


would not have been surprising but the failure 
to find a single high scorer who would firmly 
reject reversed items was most puzzling. Since 
these individuals were key members of our 


samples whose responses served as the basis for 


the item analysis, it was not clear whether this 
might be due to an artifact arising from the 
method of item selection, whether it reflected 
the possibly aberrant nature of our samples, or 
vy hether it was indeed a characteristic of high 
scorers on the F scale. 

Data have therefore been collected on eight 
additional samples. These, in all candidness, 
they were 
not systematically selected to test specific 
hypotheses about the relationship between 
the straight and reversed forms of the F scale. 
They are samples made available informally 
through the kindness of friends in testing 
their classes or in permitting the inclusion of 
the scales in ongoing studies directed toward 


were “targets of opportunity,” L.e., 


other ends. 


The Scales 

The reversed F scale was composed of Items 
3.9. 13, 15, 16, 18, 25, 26, 28, and 30 in Table 1. 
With the exception of Item 26, these are 
identical to those used in the subscale analysis 
in the exploratory study. 

The F-scale items were the originals of Nos. 
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1, 6, 8, 10, 12, 20, 22, 23, 24, and 27 in Table i. 
They differ from those used in the subscale 
analysis in that the originals of Items 9, 15, 18, 
and 30 were deleted and substitutions were 
made to prevent the presence of a reversal and 
its original in the same questionnaire. 
Ordering of the items was random. The 
customary response options were provided. 
All samples received the same 20-item scale 
under conditions noted in the description of 


each 
The Samples 


1. Undergraduate students ina Northeastern 
public university were given the 20-item scale 
during the first week of a course in introduc- 
tory psychology. It was administered by the 
instructor (male) in charge of the classes with 
assurances about the preservation of anonym- 
ity and the use of the results for research. 
The respondents were young (median age of 
18.7 years), 72 per cent were male, and they 
were predominantly lower middle-class in 
background. Sixty-two per cent were freshmen 
and 30 per cent were sophomores. 

2. Students in a graduate seminar in social 

psychology at a major Eastern university 
filled out the 20-item scale and also were 
required as part of the course to administer it 
to friends and acquaintances. Protocols of 
individuals familiar with the F scale were dis- 
carded. The sample as a whole did not respond 
to the questionnaire under standardized con- 
ditions. It is unusually high in years of formal 
education. 
3. Students in a summer school course in 
remedial reading at a prominent Eastern 
teachers college filled out the 20-item scale at 
the end of a class period. It was administered 
by a female introduced by the instructor as 
carrying out research. Most of the respondents 
female teachers from a broad 
geographic area and had a median of roughly 
15 years of teaching experience. Most of them 
had done their undergraduate work in teachers 
colleges and were taking the course to main- 
tain teaching credentials. 

4. A nonrandom but fairly representative 
sample of registered Washington lobbyists 
was given three separate questionnaires to be 
filled out and returned after each had been 
interviewed about his role as a lobbyist. One 
of the questionnaires contained six F-scale 
items and four reversals. These were inter- 
spersed with 25 other items constructed by the 


were SC hool 
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senior author which were largely taken from 
the writings of Machiavelli. A _ five-point 
response which included a neutral or 
undecided category was used. The majority 
of this sample held law degrees and most had 
previously held elective or appointive political 
office.® 

5-8. 


scale 


These samples consisted of senior 
medical students (with one junior 
class) in three top-ranking medical schools. 
Students in two which were located in the East 
had predominantly received their undergrad- 
uate training at the more prestigeful Eastern 
liberal arts colleges. Those in a Midwestern 
school were largely from the better Midwestern 


SC hool 


colleges. 

Items from the 20-item scale were randomly 
distributed among 20 Machiavellian items 
(half of which were worded in the reversed 
direction) in an “opinion inventory.” These 
and other psychological measures were part of 
a lengthy questionnaire devoted to experiences 
in medical school. In instances the 
questionnaire was answered in a class setting, 
in other cases individuals filled it out at home.® 


some 


Results 

The data summarized in Table 4 indicate the 
consistency with which positive correlations 
between the F and reversed F scale were found. 
The samples varied in education from begin- 
ning college students to those with eight years 
of undergraduate and professional experience 
in the finest of our educational institutions, 
from predominantly female elementary and 
high school teachers to lobbyists with a bad 
some answered the 


case of ‘‘Potomac Fever,” 


questionnaires anonymously in class while 
others filled them out in conjunction with 


personal interviews, the items were sometimes 
presented as an individual and were 
sometimes mixed with other items and in five 


SX ale 


samples were given along with a large number 
of unrelated questions, and in general the 
customary seven-point scoring system was used, 
but in one case five-point response categories 
with a neutral option were used 

’ This study of the role and personality attributes 
under the auspices of the 
Lester 


of lobbyists was carried out 
Brookings Institution under the supervision of 
Milbrath 

* The 
research on the Sociology of Medical Education by 
the Bureau of Applied Social Research at Columbia 
from the Commonwealth 


samples were studied as part of ongoing 


University under a grant 


Fund 


Despite all these variations that might 
possibly affect the nature of the responses 
positive correlations were found in all eight 
samples. When these are combined with the 
findings in the exploratory study it is found 
that in twelve of twelve samples using various 
combinations of these items the results are 
such as to indicate that response set is not 
important enough to produce negative correla- 
tions. 

The other problem that intrigued us—that 
of the behavior of the extreme highs on re- 
versed items—is partially clarified by a closer 
inspection of these data. The distribution of 
responses on the F scale was leptokurtic in the 
undergraduate sample, there being but eleven 
individuals with a mean score of 3.0 or below 
and five with scores of 6.0 or above. The heavy 
concentration of respondents in the neutral 
and slightly accepting range casts some light 
on the very low correlation between the scales 
in this sample since the generalizations made 
earlier about the relationship between mean 
acceptance of F-scale items and acquiescence 
were confirmed. However, of the five high 
scorers, two displayed acquiescent tendencies, 
two behaved similarly to the extreme highs 
discussed earlier, and one was consistent. 

Further evidence that the extreme high was 
not necessarily inconsistent came most strongly 
from the lobbyist sample where there was very 
little evidence of acquiescence. Indeed, we 
found one “perfect” high scorer (within the 
limits of the items used) who strongly accepted 
all six F-scale items and strongly rejected all 
four reversals. Unfortunately, there were not 
enough extreme high scorers among these 
samples to serve as a sound basis for generaliza- 
tions. 

The wide range of reliabilities on both the 
original and reversed F scales and the varying 
correlations between them posed problems in 
interpretation. It should be added that where 
data from other attitude scales were available, 
they indicated that the amount of ideological 
consistency present was not unique to the F 
scale. The Washington lobbyists, for example, 
had the highest internal consistency on the 
Machiavelli scale of nine samples totaling some 
2,000 Ss tested to date. This finding parallels 
their results on the F scale. Similarly, Sample 5, 
composed of a particularly aberrant (statis- 
tically speaking) group of medical students, 
showed inconsistency similar to that on the 
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TABLE 4 


SuMMARY RESULTs ON F AND REVEPSED F 


In E1cut SAMPLES 


Reversed Correlation 


F Scale F Scale between F 
3 on = and Reversed 
Sample N M SD Rel VU SD Rel. F Scales* 
1. Northeastern undergradu 152 4.54 84 .52 3.73 78 43 .10 
ates 
2. Graduate students 57 2.74 95 78 2.72 93 57 5! 
3. Lobbyists 86 3.05" 59 54' 3.03» 81 414 55 
4. Remedial reading teachers 61 3.43 1.05 68 3.20 94 60 .29 
5. Midwestern medical stu 72 3.50 69 34 3.35 55 18 .45 
dents (seniors 
6. Midwestern medical stu 80 3.55 88 66 3.47 70 26 23 
dents (juniors 
7. Eastern medical students’ 73 3.38 92 75 3.35 91 72 38 
(seniors 
8. Eastern medical students*® 121 3.76 86 58 3.17 68 42 .48 
seniors 
* All ns were corrected for attenuatior 
A five nt scale was used in this sample. These figures are comparable to slightly over 4.0 in other samples 
Base r six items 
* Base or nly four items 
F scale when responding to Machiavellian cation and are illustrating its use with two 


items but were very consistent on a measure 
of manifest anxiety. Such findings suggest that 
samples vary widely in the amount of acquies- 
cence that they will display in response to items 
of an ideological nature. 

The investigation of differences from sample 
to sample demands a more refined technique 
of measuring response set than that used in 
the preceding examples. Before pursuing the 
substantive problem, it is necessary to indli- 
*thod used 


cate the me 


4 MetTHop OF MEASURING ACQUIESCENCI 


The diagnosis of acquiescence used with the 
four exploratory samples crude and 
obscured the interaction of and re- 
sponse style. The correlations and reliabilities 
indicated in Table 4 suggest widely varying 


was 
content 


probabilities of response set in different sam- 
ples. These respondents were subjected to 
different numbers of items and in some cases 
different choices of response categories. In 
considering the problem of comparability from 
was decided that it was 
statements 


sample to sample, it 
to the 
agreed to on the F and reversed F scales as the 
This convention permitted the 


simplest number of 


use 
basic data. 
use of the binomial expansion and the ascer- 
tainment of the probability of a given pattern 
of response 

Although the technique is basically simple, 
we are unfamiliar with examples of its appli- 


five-item scales. 

There are six possible degrees of agreement 
for each five-item scale. Cross-multiplication 
of the binomial frequency for each response 
category on each scale indicates the chance 
probability in 1024 times of the frequency of 
a given cell in the matrix, e.g., the probability 
of a given individual agreeing with all five 
items in each scale is one in 1024. The ex- 
pected frequency of any given pattern of 
response is indicated in Fig. 1. 

The information in the matrix is 
the discrepancy scores between the two scales. 
If the two scales are measuring essentially the 
same thing—and the means on two sophis- 
ticated samples (graduate students and lobby- 
ists) as indicated in Table 4 suggest that this 
is approximated—then an individual who 
agrees with all five F-scale items should tend 
to disagree with all five reversed items. A 
simple way to check for consistency is to 
reverse the number of agreements on the re- 
versed scale so that agreement with all five is 
scored 0 and disagreement with all is scored 5. 
Subtraction of the reversed scale score from 
the F-scale score gives a measure of content 
consistency. A perfectly consistent individual 
should have identical scores on both scales 
A person who is indiscriminately accepting 
will have a discrepancy score of +5. If he is 
recalcitrantly negativistic he will have a dis- 
crepancy score of —5. 


second 
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The number in the upper right hand corner of each 
ell indicates the chance expectancy in 1024 times of a 
given pattern. The upper small number indicates the 
discrepancy score; the lower one the total F score on 
the combined scales. The numbers at the right and 
yottom are the summed frequencies for the cells with 
scores along the diagnonals of 


lentical discrepancy 


the matrix 


The next to determine the proba- 


bility of occurrence of a given discrepancy 


step 1s 


score. Summing along the diagonals composed 
f cells with the same discrepancy scores gives 
the appropriate frequencies. It will be noted 
that this distribution is the binomial expansion 
of 2°. The probability of occurrence of a dis- 
crepancy score of +3 or greater is .055 ({1 + 
10 + 45}/1024). 
customary .05 level as can be obtained and is 


This is as close to the more 


sufficient for present purposes. Since the matrix 
is symmetric, discrepancy scores of -—-3 or 
greater indicate a tendency to disagree sig- 
nificant at the same level 

A combination of scores on both scales can 
also be used to obtain a measure of overall F 
by scoring along the orthogonai diagonal. An 
individual who rejects all five of the original 


items and accepts all five reversals is clearly 
low on the dimension and is given a score of 0 
is indicated in Fig. 1. His opposite number ob- 
tains a score of 10. Scores of 2 or less and 8 or 
differ from chance ex- 


more significantly 


pectancy. 


Acquiescence in the College Trained Population 


A clearer picture of the operation of acqui- 
escence response set may be had by introducing 
data collected on one further sample. This is 
a subsample of the 1956 post election survey 
conducted by the Survey Research Center of 
the University of Michigan. Adult respondents 
selected in a nationwide probability sample 
were asked their education and 103 reported 
having at least some college training. In the 
course of an interview they responded to five 
F-scale items and five reversed items.’ The 
preceding method of analysis was employed 
and it was found that five per cent were sig- 
nificant!y acquiescent, 28 per cent slightly so, 
23 per cent were completely consistent, 33 
per cent showed slight negative tendencies, 
and 11 per cent had a significant negative 
response pattern. It is of considerable interest, 
in light of current equation of acquiescence 
with response set, that this sample displayed 
slightly more negativism than acceptance in 
responding to items. The amount of observed 
response set does not, however, differ from 
chance expectancy. 

This finding takes on possible significance 
since graduate students and Washington lobby- 
ists also displayed a negligible amount of 
response set. These are samples which are pre- 
sumably the least confused ideologically. The 
sample which is most distinctive is that com- 
posed of Northeastern undergraduates. A 
technique similar to that indicated in Fig. 1 
was employed for the two ten-item scales they 
answered. The probability of having a dis- 
crepancy score of +4 or greater is .059. 

Thirty-five or 23 per cent of these students 
showed a significant amount of acquiescence 
and five (three per cent) showed significant 
negative tendencies. Although the definition 
of significance level is slightly different for 
these undergracvates and the nationwide 
sample, the difference is quite striking. Why 


should undergrad:\ates be so much more 


7 The relationship between education and response 
set in the entire sample is currently being analyzed 
and will be reported in a separate publication. The 
items were slightly modified from the wording indicated 
in Table 1 for use with respondents of varying verbal 
sophistication. Details of the sampling procedures 
and the analysis of the interview material will be 
found in the forthcoming book on the 1956 presidential 
election by Angus Campbell and Warren Miller. This 
study was carried out by the Survey Research Center 
with a grant from the Rockefeller Foundation 
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than a 
idults exposed to college (only two-fifths of 
precise answer 
investigation of 
time. It 


ucquiescent nationwide sample of 


whom were graduates)? A 


would require a detailed 
factors influencing 
should be noted, however, that such a finding 


is congruent with a variety of data on attitude 


changes over 


changes in the first years of college. Jacob has 
summarized a number of studies dealing with 
scale and other measures 
civil liberties 
ind other attitudes liberalism 
16). The general finding is that students are 


changes on the F 
as favorableness toward 


indicative of 


suc h 


most conservative when they enter college. 
Exposure to the more liberal norms of college 
isually leads to high scorers dropping out of 
school, changing in a more liberal direction, or 
adamantly holding to their original beliefs 
Newcomb has furnished graphic illustrations 
of the last two reactions [21]). The weight of 
that this conflict of 
values is most severe during the initial year or 


the evidence indicates 
so of college. 

Our tentative interpretation is that many 
among these undergraduates—who were in the 
very early days of college—were in a stage of 
value transition and possibly conflict. They 
still tended to accept F-scale items which are 
more widely accepted in noncollege circles 
such as those from which they had just come. 
At the the reversed 
presumably more in keeping with the value 


same time, items were 
climate of the university to which they were 
adjusting. Many- of the students tended to 
both sets of items. This “‘acquiescent 
mere 


accept 
response set” does not strike us as a 
mechanical way of responding; it probably 
reflects another facet of Cronbach’s discussion 
of the ambiguity of an item. In this case, items 
reflecting contradictory views of the world are 
accepted because the student has not yet de- 
cided for himself which is the right response; 
different reference 


both are supported by 


groups 
DISCUSSION 


Although the preceding digression is slightly 


speculative, it has important implications if 
valid. An initial question about some of the 


studies emphasizing response set focused upon 
the actual scale scores of the Ss and the fact 
that “high” scorers appeared to be only rela- 
tively high and were in actuality inconsistent 
in their responses. Upon the basis of these later 


considerations, based, it is true, upon a limited 
sampling of undergraduates, it would appear 
that the majority of these studies have used 
subjects who were most liable to display re- 
sponse set of any populations with exposure to 
higher education, i.e., undergraduates at an 
early stage of training. 

There is another obvious implication of the 
relatively high number of acquiescent respond- 
ents in the undergraduate sample. Most of the 
studies involving Ss high and low on the F 
scale use samples drawn from undergraduate 
populations and most usually those in the 
first two years of college. This means, quite 
bluntly, that the high scorers are quite likely 
a mixed group. Some are genuine high scorers 
with a consistent set of authoritarian attitudes 
that involve rejection of reversals as well as 
acceptance of the F-scale items. Others may be 
simply agreeing to statements of this sort and 
the fact that some of them are “authoritarian” 
and others are not is a reflection of their value 
confusion and not necessarily indicative of 
potentially fascist inclinations at the per- 
sonality level. 

The possible confusion that might result 
from treating all in: ‘viduals with high F-scale 
scores as equivalent in a study relating the F 
scale to a measure of intolerance of ambiguity, 
for example, should be obvious. Those who 
are high scorers by virtue of being acquiescent 
are by definition tolerating a considerable 
amount of ambiguity since they are agreeing 
to psychologically incompatible statements! 
That undue reliance on the F scale alone as 
a measure of authoritarianism might be mis- 
leading can be indicated by examining the 
pattern of responses of those undergraduates 
in the Northeastern sample who accepted eight 
or more of the ten F-scale items. Twenty of the 
37 turn out on inspection to be significantly 
acquiescent but not significantly high on the 
overall measure of F, eleven are significantly 
authoritarian but not acquiescent, and seven 
are not significantly high on either measure 
(one individual with a score of 15 on F and 
a + 4 discrepancy score was both significantly 
authoritarian and significantly acquiescent). A 
virtue of the suggested method of scoring is 
that it gives two scores to an individual—one 
a measure of authoritarianism in which re- 
sponse set is balanced out and one a measure 
of response set. With a large enough number of 
items, it is possible to have overlap on the two 
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measures, e.g., two Ss among the 72 per cent 
of the graduate student sample who were sig- 
nificantly low on the overall measure of 
authoritarianism were also significantly acqui- 
escent and another two low scorers were 
significantly negativistic. 

It is not news, of course, to point out that 
the F scale is not unidimensional. The present 
argument suggests, however, that there are 
possibly greater differences among high scorers 
than low scorers in the customary under- 
graduate samples. When relatively high and 
low scorers are compared differences will be 
found when the low scoring Ss are different 
from all other groups. This may account for 
many of the correlations found with other 
attitude scales when the generally more 
sophisticated low scorers differ from both the 
consistent high scorers and the confused acqui- 
escent high scorers. The generally murky pic- 
ture that prevails, however, as to the relation- 
ship between the F scale and measures of such 
behavioral tendencies as intolerance of am- 
biguity and rigidity might arise from the fact 
that the low scorers are similar to consistent 
high scorers in having an organized Weltan- 
schauung but are more similar to acquiescent 
high scorers in being able to tolerate certain 
kinds of ambiguity. 

These are speculations and their validity 
can be determined only by empirical studies 
using techniques that clarify the meaning of 
the F-scale scores of a given S. It must be 
emphasized that the majority of the samples 
reported on in this discussion were accidentally 
and not randomly drawn so the only generali- 
zation about the distribution of response set 
which is justified is to the nationwide popula- 
tion of adults with some college training where 
no significant amount of acquiescence was 
found. The presence of some acquiescence 
among college undergraduates raises questions 
about the meaning of some of the results 
found with other samples of undergraduates 
but does not necessarily imply that they too 
are laden with acquiescent individuals. 

It should also be noted that the possibility 
of response set in samples studied by means of 
the F scale does not necessarily indicate the 
relevance of response set to studies where the 
E scale was used as the independent variable. 
A study on response set by Shelley yielded a 
correlation of +.27 (p = .05) between the 
anti-Semitism subscale and the use of “like 








much” responses to 60 pictures on the Per- 
ceptual Reaction Test of Berg and Hunt (25). 
This is a lower correlation than the majority 
of those cited earlier between the F scale and 
various measures of response set. Prentice 
found 16 favorably worded items toward Jews 
and Negroes correlated less highly (+-.78) than 
did 16 unfavorable statements which corre- 
lated +1.00 with the E scale. Although we are 
in basic agreement with Prentice’s conclusion 
that the use of positive or negative items is of 
little relevance in the measure of ethnic 
prejudice, we note as do Gage ef al. that it 
does make some difference, a conclusion in 
agreement with the earlier research of Camp- 
bell and McCandless (23). Actually, this 
slight difference is somewhat surprising since 
some of the items used by Prentice violate 
standards we have suggested earlier, e.g., 
“‘No Jews are especially loud, noisy and over- 
bearing” (23, Footnote 3, p. 420, italics added). 

Our interpretation is that in these two 
studies, slight evidence of response set was 
found but, given the difficulties of comparing 
studies using different techniques and samples, 
that the amount of response set is even less 
than that found in studies using the F scale. 
This result is consistent with the general 
thesis that response set is greatest in ambigu- 
ous situations; the use of a minority group as 
a referent in an attitudinal statement is pre- 
sumably less vague for most respondents than 
people in general or the sex life of the ancient 
Greeks and Romans. 

In concluding, the data indicate that acqui- 
escence is not identical with authoritarianism. 
This interpretation is at variance with the 
earlier studies reviewed, a discrepancy here at- 
tributed at least partially to the fact that 
many of the reversed items used in them were 
irrelevant to a meaningful measure of authori- 
tarianism. It also appears that the nature of 
the samples used was a contributing factor 
since the high scoring subjects in many of 
these studies were inconsistent in their 
responses partly because they were drawn 
from populations containing a relatively large 
proportion of individuals who were, although 
probably but temporarily, ideologically con- 
fused. 


SUMMARY 


Four recent studies which have found a lack 
of positive correlations between the F scale 
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and scales of reversed items were examined. It 
was argued that the attempts at reversals 
were often characterized by items that were 
not psychologically opposed to the original 
items. It also appeared that calling upon the 
operation of an acquiescent response set as 
an explanation of the results was questionable 
“high” scorers in these 
samples were inconsistent in their responses 


since many of the 
and were not generally agreeing with F-scale 
items. 

A new set of items designed to overcome 
some of our objections was developed. An item 
analysis indicated that the majority of the 
reversed items discriminated positively be- 
tween individuals high and low on the F scale 
in two pairs of subsamples. An examination of 
the response patterns on the most discriminat- 
ing items indicated that a minority of the 
respondents showed response set and that 
they were among the individuals who showed 
slight agreement with the F-scale items. 

Further results were obtained on eight addi- 
tional samples. All correlations were positive 
although there was great variability. A method 
response 
set was developed and data from an additional 
tive nation-wide sample of adults 


for determining the significance of 
represents 
with a college background indicated no sig- 
nificant degree of acquiescence. This was also 
true of our most sophisticated samples but 


appreciable amounts of acquiescence were 


found in an undergraduate sample. Possible 
implications of this finding for research using 
on undergraduate samples were 
the data do not 


the F scale 
noted. It 
support the identification of acquiescence with 


is concluded that 


authoritarianism. 
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DIFFERENTIAL AUTONOMIC RESPONSIVENESS IN THE PROCESS- 
REACTIVE CLASSIFICATION OF SCHIZOPHRENIA! 


GERALD F. KING 
Michigan State University 


ONSIDERABLE research attention has re- 
cently been focused on the relation- 
ship between autonomic nervous 

system (ANS) activity and schizophrenic 
behavior. Funkenstein and collaborators (9, 
10, 11, 12) have been able to predict prognosis 
to electric shock and other forms of therapy 
with a variety of neuropsychiatric patients on 
the basis of ANS reactivity to certain drugs. 
Confirmatory evidence for the relationship 
between ANS activity and prognosis in 
schizophrenia has been provided in studies by 
Geocaris and Kooiker (14) and Hirschstein 
(15). In the initial method of analyzing ANS 
activity (9), changes in the systolic blood pres- 
sure after adrenergic stimulation (intravenous 
epinephrine) and cholinergic stimulation (in- 
tramuscular mecholyl) were ciassified into 
seven patterns of response. The number of 
blood pressure patterns has been reduced from 
seven to three in subsequent research (23). 
Further modifications have been introduced in 
regard to mecholyl; response to this drug has 
been expressed in quantitative terms, with 
measures of the extent and duration of blood 
pressure changes being employed (13, 24). 

In attempting an integration of physiological 
and psychological processes, Meadow and 
Funkenstein (23) found a relationship between 
the ANS response patterns of schizophrenics 
and impairment in abstract thinking. Such a 
relationship was predictable on the basis of 
earlier studies by Bolles ef al. (4) and others, 
who concluded that poor prognosis was asso- 
ciated with low performance on tests of ab- 
stract thinking. An investigation by Ficca (8) 
revealed that neuropsychiatric patients with 
“unfavorable” ANS patterns of response, in 
terms of prognosis, produced significantly 
more “‘schizophrenic” signs on the Rorschach 
than patients with “favorable” ANS responses. 


' This research was initiated while the author was on 
the staff of Veterans Administration Hospital, Battle 
Creek, Michigan. Grateful acknowledgment is due 
David W. Merrell for his early collaboration, Harry 
VanderKamp for providing most of the physiological 
measures, and Frank Restle for his interest and sug- 


gestions. 


In a later study by Meadow and his associates 
(24), in which only mecholyl stimulation was 
used, measurements of ANS activity were ob- 
tained on what might be called a “level of 
autonomic responsiveness” dimension (e.g., 
maximum fall in systolic blood pressure). A 
significant positive relationship was found 
between level of autonomic responsiveness and 
level of abstract thinking with schizophrenic 
subjects. The authors interpret the results as 
suggesting two polar types of schizophrenia, 
with the descriptions corresponding closely to 
the so-called “‘process” and “reactive” syn- 
dromes. The purpose of the present research 
was to explore further the relationship between 
level of autonomic responsiveness and the 
process-reactive classification. 

The following gives a brief delineation of the 
process-reactive concept, which is usually 
viewed as a dichotomy: process schizophrenia 
is indicated by a gradual onset of psychosis 
and a relatively inadequate prepsychotic per- 
sonality, whereas reactive schizophrenia is 
characterized by sudden onset of illness in a 
relatively normal prepsychotic personality. 
Additional criteria are given by Kantor, 
Wallner, and Winder (18). The distinction, 
for a particular patient, is commonly made on 
the basis of personal history information. 

A number of investigators (17, 20, 25, 27) 
have presented evidence showing that schizo- 
phrenics with process characteristics have 
lower rates of recovery than patients more 
reactive in nature. In a study by Kantor, 
Wallner, and Winder (18), the Rorschach 
protocols of process schizophrenics were judged 
as being more “psychotic” than those of reac- 
tive schizophrenics. Brackbill and Fine (6 
found that process schizophrenics could not be 
differentiated from organics on the basis of 
Rorschach indices of organicity, but that 
reactive schizophrenics showed significantly 
fewer signs of organic involvement than both 
groups. Becker (3), conceiving of a process- 
reactive continuum, found a significant rela- 
tionship between this continuum and “genetic’’ 
scores derived from the Rorschach. The data 
also revealed a relationship between the 
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process-reactive dimension and the conceptual 
level of the responses to a proverbs test, but 
the relationship was restricted to male patients, 
not appearing in females. It can be summarized 
that, in with schizo- 
phrenics, reactives (a) have a more favorable 
(6) produce Rorschach 

which are less apt to be judged as “psychotic,” 
(c) respond signs on the 
higher 


comparison process 


prognosis, records 
with fewer organic 
Rorschach, give responses of a 
genetic level on the Rorschach, and (e) possi- 
bly interpret proverbs on a higher conceptual 
ible physiological and psycho- 
that process- 
be used as a frame of 
predicting ANS responsiveness. 
h 


logic: nce suggests the 


reactive concept car 


reference for 
The following hypothesis is offered: Predom- 
inantly reactive schizophrenics exhibit a higher 
evel of autonomic responsiveness after the in- 
jection of mecholy! than predominantly proc- 
The adverb 
predominantly” makes the hypothesis more 
ongruent with Becker’s method of analysis 
3). The variety of criteria used in making the 
process-reactive distinction permits patients to 
display a mixture of both characteristics. What 
is involved is a typology of personal history 
incidents but not a typology of patients. Thus, 
fall anywhere on a scale 
from “purely process” to “‘purely reactive.” To 
be presented in this paper are two studies 
which bear on the hypothesis indicated 


schizophrenics. modifying 


ess 


a given patient may 


Stupy I 
Subjects 


The basic 
italized 


group of Ss consisted of 136 recently 
hiatric patients, all being given 
A variety of diagnostic 


None of the 


f neurological complica- 


host neuropsy< 
the diagnosis of schizophrenia 
subtype 1 symptoms were represented 
wed any evidence 
ns routine examination. Ages ranged from 20 
to 42 yea All patier hospitalized eight 


or less when the physiological 


fron 
ts had beer 
measures were 
obtained 


Measure s 


logical 
The method of securing the 
of ANS activity 
Funkenstein, 


physiological measures 


used by 
While the 


awaking in the 


was a modification of that 


Greenblatt, and Solomon (9 
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after 
morning. t] resting svs lic } ] 0d 
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intramuscularly, 
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Injection of 
Mecholyl 


LIC BLOOD PRESSURE(mmng) 


TO 


fake le we 
o2 5 io 2 IS 20 
MINUTES 
Fic. 1. DiaGRamMaATic ILLUSTRATION OF RESPONSE 
or Systotic Bioop PresstrE TO MECHOLYL 
(ADAPTED FROM MeEapow ef al. [24)) 
Maximum fall in blood pressure is indicated by X 
Area under baseline of resting blood pressure is in 
dicated by Y 


YS 











Ss 





Meadow 


maximum fall in systolic 


et al. (24), were obtained: (a 
blood pressure below the 
lowing the injection of 
mecholyl and (6) area under the base-line (projection 
of the resting blood pressure) demarcated by blood 
readings following mecholy!. A graphic illus- 
tration of these measures is given in Fig. 1. 

The correlation between the two measures of au- 
tonomic responsiveness was found to be .82 (WN = 28}, 
indicating a high degree of commonality. Of the two 
measures, maximum fall in blood pressure 
(MFBP) was much easier to tabulate, involving merely 
the subtraction of the lowest systolic blood pressure 
reading from the basal reading. In the present research, 
only MFBP was employed as a measure of level of 
autonomic responsiveness. Indirect evidence of the re 
liability of this measure is indicated by its being posi- 
tively correlated with level of abstract thinking in two 
independent samples (24 


derived from 


resting blood pressure f 


pressure 


systolic 


Procedure 
A sample of 60 was drawn from the basic group of 
schizophrenic Ss. All Ss were classified on the basis of 
case histories as either “process” or “reactive” by the 
present investigator, using the frame of reference pro- 
vided by Kantor, Wallner, and Winder (18). Inde- 
pendent classification of all Ss was accomplished by 
either one of two other raters. None of the raters had 
The per 
centage of agreement between the two judgments was 
76.7. Analysis of the agreement between the judgments 
with a fourfold table yielded a chi square of 17.47, 
which is significant at the .001 level of confidence. This 
is essentially of the same order cf interrater agreement 
reported previously (18). Discussion among the raters 
revealed that at least two factors probably lowered 
reliability: (a) instances of inadequate case histories 
and (5) case histories that indicated process and reactive 
characteristics to an almost equal degree. 


knowledge of the physiological measures 
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Using only Ss on which there was classificatory 
agreement resulted in process and reactive groups con 
sisting of 22 and 24 Ss, respectively. A comparison of 
the groups revealed no significant difference in age. 
Further, in considering the process-reactive concept as 
a continuum, 16 Ss were randomly selected from the 
two groups and ranked accordingly by two independent 
raters. Rank 1 was assigned to the S judged to be the 
most process in nature, with rank 16 being occupied by 
the S being most clearly reactive. Agreement between 
the raters is indicated by a rho of .89 


Results 


The mean MFBP score of the reactive schizophrenics 
was 24.41 (SD = 8.25), as compared to 18.50 (SD = 
7.71) for the process schizophrenics.? The usual ¢ test 
gives a value of 2.51, which indicates that this differ- 
ence is significant at the .01 level (one-tailed test). For 
the 16 Ss, the rho between the sets of ranks on the 
process-reactive dimension and MFBP is —.58 (p < 
01, one-tailed test). In accordance with the hypothesis, 
the reactive schizophrenics displayed a higher level of 
autonomic responsiveness following the injection of 
mecholyl than the process schizophrenics. The results 
also indicate that it is feasible to order the process and 
reactive syndromes on a continuum 


Stupy II 
Subjec ts 


With the elimination of the 46 Ss who participated 
in Study I, the basic group of Ss was reduced to 90 


Physiological Measures 


The same method as in Study I was employed 


Procedure 


The following modification was introduced in the 
classificatory procedure: all 90 Ss were classified as 
either “process,” “process-reactive,” or “reactive.’’ The 
process-reactive category was designed to be an inter 
mediate one, representing schizophrenics with both 
types of characteristics to an almost equal degree. 
Otherwise, the same rating procedure was followed as 
in Study I. The percentage of agreement between the 
independent judgments was 87.8. The corresponding 
chi square, using a 3-by-3 contingency table, was 
121.53, which is significant considerably beyond the 
.001 levei. Groups representing the three categories 
were formed from the Ss on which there was classifica- 
tory agreement. Balanced for N, each group consisted 
of 24 Ss. A comparison for age yielded no significant 
differences among the groups. 

In addition, scores on the Elgin Prognostic Scale 
(31) were obtained for 22 Ss. This scale consists of 
ratings on factors which are based on the process- 
reactive differentiation. Data for the ratings were se 
cured from interviews and clinical folders (case his- 
tories, etc.). Higher scores on the scale represent a 
greater incidence of process elements. Wittman (31) 
reports an interrater reliability coefficient of .87 for an 

*In comparison with the distribution reported by 
Funkenstein ef al. (13), the MFBP scores obtained in 
the present research seem to be depressed 


N of 61. In the present research, 12 Ss were rated 
independently by two raters, the interrater rho be- 
ing .94 


Results 


Treating the MFBP scores of the three groups by 
analysis of variance yielded an F ratio that is significant 
beyond the .01 level. The mean MFBP scores were of 
ascending magnitude, with the process schizophrenics 
being lowest (M = 17.04), the process-reactives inter- 
mediate (M = 22.79), and the reactives highest (WM = 
26.62). This trend is in accordance with the hypothesis 
and the conception of a process-reactive continuum 
Since this is not a post hoc analysis, there is justifica- 
tion for computing ¢ tests between the individual means. 
Using one-tailed tests, the results show that the process 
schizophrenics were significantly lower than the process- 
reactives (¢ = 2.39, » < .025), who in turn approached 
being significantly lower than the reactive schizo- 
phrenics (¢ = 1.54, p < .07). The rho between the 
Elgin scale and the MFBP scores (NV = 22) was —.49 
(p < .025, one-tailed test). The findings thus indicate 
that the reactive schizophrenics exhibited a greater 
autonomic response to mecholyl than the process 
schizophrenics; or, in other words, the process-reactive 
dimension is inversely related to level of autonomic 
responsiveness. As in Study I, the hypothesis is sup 
ported. 

DIscUSSION 


The results provide a basis for discussing 
and speculating about some of the past re- 
search dealing with physiological aspects of 
schizophrenia. Citing a variety of supporting 
studies, Angyal, Freeman, and Hoskins (2) 
and Hoskins (16) characterize schizophrenics, 
in comparison with normals, as showing re- 
duced physiological responsiveness. However, 
confirmation of this position has not been 
obtained in a number of other studies (1, 7, 
26). As suggested by Sands and Rodnick (29) 
and as illustrated by Malmo, Shagass, and 
Smith (22), contradictory results in this area 
are undoubtedly due in part to variations in 
the types of experimental stimulation and the 
nature of the physiological measures. Varia- 
tions in the schizophrenic samples employed 
in different studies is another likely factor. 
According to the present research, responsive- 
ness is related to the process-reactive make-up 
of the schizophrenic sample. If we consider 
pure cases of “process” and “reactive” pa- 
tients for the sake of discussion, the evidence 
points to diminished physiological responsive- 
ness in process but not in reactive schizo- 
phrenia. Thus, reduced responsiveness would 
not be found in studies where the schizo- 
phrenic subjects were predominantly reactive 
in character. 


AUTONOMIC RESPONSIVENESS AND THE PROCESS-REACTIVE CLASSIFICATION 


One possible reaction to the preceding dis- 
cussion is that it moves in the direction of a 
two-factor theory of schizophrenia. Process 
characteristics are associated with a physio- 
“impairment,” at least implicitly. 
it would seem, is to tie the dis- 


logical 
What remains, 
turbance in the reactive syndrome to inter- 
personal factors. This notion is compatible 
contention of Brackbill (5), who, 
after reviewing the studies of brain dysfunction 
he existence of 


with the 


in schizophrenia, postulates t 
central nervous system pathology in process 
but not in reactive patients. In this context, 
the question arises as to whether autonomic 
disturbances are primary or secondary to 
cortical defect. 

If the information on normal responsiveness 
is given more consideration, it is possible to 
in a different light. Althoug! 
reactivity of normals to 
mecholyl are meagre (10, 14), it seems to indi- 
cate that their blood pressure responses are 





cast the problem 


very similar to what can be expected from 
process schizophrenics. In graphic terms, if 


level of autonomic responsiveness represents 
the ordinate and normals, reactives, and proc 
ess schizophrenics lie on the abscissa in that 


order, the relationship can be described as 


curvilinear. In accordance with this hypo 
thetical graph, the reactives would be high in 
autonomic responsiveness, whereas the normals 
and process patients would be relatively low. 
more adeqcuate 
found ‘that 
schizophrenics, patients in the 
early stages of illness, are physiologically even 
more reactive than normals (e.g., 21, 30). A 
study by Pfister (28) gives a more complete 
the curvilinear relationship: 


Such a relationship offers a 


account of studies which have 


especially 


illustration of 
Early schizophrenics were hyper-reactive on 
cardiovascular the illness per- 
sisted, however, the same Ss dropped toward 
or even below the normal level. Although a 


measures; if 


loose interchange of concepts is sometimes 
necessary, a good deal oi the available data 
can be fitted to the curve. A possible explana- 
tion lies in viewing both normals and process 
sc!::zophrenics as representing relatively uni- 
form, stable adjustments over a period of time. 
It is suggested that the corresponding physio- 
logical homeostasis is indicated by a low 
autonomic responsiveness. A heightened au- 
tonomic reactivity then would reflect a dis- 


ruption of homeostasis, which corresponds 


“physiological findings, 
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with the more transient, precipitous nature of 
the reactive adjustment. 

Although problems of interpretation are 
posed, the results in general provide additional 
evidence of the meaningful nature of the 
process-reactive classification, or dimension, of 
schizophrenia. Using this concept of schizo- 
phrenia, which cuts across symptomatology, 
evidence has been compiled from this and 
other studies revealing  interrelationships 
among several classes of variables: case history 
information, psychological dimensions, physi- 
ological measures, and prognosis. The numer- 
ous failures of past studies dealing with 
schizophrenia seem to give this classification 
even more significance. When samples of un- 
differentiated schizophrenics, or even ones 
based on the standard diagnostic subtypes, 
have been employed in research, the findings 
have typically been either negative, not 
readily interpretable, or not reproducible (19). 
Pending the development of a better schema 
for delineating the heterogeneous category of 
schizophrenia the process-reactive classifica- 
tion seems to offer a productive orientation 
for research. 


SUMMARY 


This investigation consisted of two studies 
which were concerned with the relationship 
between the process-reactive classification of 
schizophrenia and autonomic nervous system 
activity. On the basis of psychological and 
it was hypothesized 
that predominately reactive schizophrenics 
display a higher level of autonomic responsive- 
ness after the injection of mecholyl than pre- 
dominantly process schizophrenics. Similar in 
nature, the two studies had the following 
methodological features in common: (a) The 
process-reactive classification of schizophrenic 
Ss was made on the basis of case history in- 
formation, (b) a small sample of the total V 
was ranked on a process-reactive continuum, 
and (c) maximum fall in systolic blood pres- 
sure constituted the measure of autonomic 
responsiveness. 

In accordance with the hypothesis, the re- 
sults of both studies revealed that the patients 
classified as “reactive”’ exhibited a significantly 
greater fall in blood pressure after the adminis- 
tration of mecholyl than the prucess patients. 
Significant rank-order correlations were also 
obtained between the process-reactive dimen- 
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sions employed in each study and maximum 
fall in blood pressure, indicating that it is 
feasible to order the process and reactive 
syndromes on a continuum. Some implica- 
tions of the findings in regard to physiological 
responsiveness in schizophrenia were dis- 
cussed. It was concluded that the process- 
reactive classification seems to provide a 
fruitful frame of reference for research in 
schizophrenia. 
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AMBIGUITY AND ANXIETY’ 


ANDREW S. DIBNER 
Clark University 


HE concept of anxiety has been subject 

to considerable scrutiny in the last few 

years. The present study is concerned 
with the meaning of anxiety, where anxiety 
is considered as a momentary state, not as a 
“personal constant” (14) or as a disease 
process (17). The thesis presented is that 
anxiety is directly related to the degree of 
ambiguity in the situation to which the indi- 
vidual must make some adjustive reaction, 
that it is the condition of uncertainty (of per- 
ception, expectation, success, prediction, con- 
trol, or even survival) which is the essence of 
anxiety, and that the probability of uncer- 
tainty is greater whenever external conditions 
are ambiguous. The ambiguous field permits a 
variety of possible structures (20), offering 
the individual a complex array of choices with 
a lessened probability of correctly predicting 
the results of his decisions because of a lack of 
definitive cues. Cues may be absent, vague, 
and inconsistent or contradictory. A highly 
structured field, on the other hand, provides 
guide lines; the individual may properly eval- 
uate it and behave toward it in a meaningful 
way. There is no implication that ambiguity 
is always a sufficient condition for anxiety; 
the present argument bears only on situations 
containing some element of personal im- 
portance or ego involvement and calling for 
an adjustive response. 

In this paper the terms “ambiguity” and 
“structure” are used to describe both objective 
properties of the stimulus field and psycho- 
logical interpretations of the field. The stim- 
ulus field is said to have greater structure 
according to the extent to which it contains 
sufficient perceivable characteristics to be 
subject to a greater consensus of interpreta- 
tions by observers. It is relatively ambiguo™:s 
(or unstructured) if its characteristics are less 
sharply defined, permitting more alternative 

1 This paper is based upon a doctoral dissertation 
done at the University of Michigan (9). The writer 
wishes to express his appreciation to Edward S. Bordin 
for his stimulating guidance. Some support was re- 
ceived from the Research Project: Analysis of Thera- 
peutic Interaction, USPHS M-516, National Institute 
of Mental Health, Bethesda, Maryland. 


interpretations. Thus, one can diminish the 
structure of a stimulus field in an objective 
way by subtracting information from it, or 
one can add to structure (diminish ambiguity) 
by adding information. On the other hand, the 
psychological interpretation of the structure 
of a stimulus field (Subjective Structure as 
opposed to Objective Structure) has as its 
determinants not only the structural proper- 
ties of the field but also those intrapersonal 
attributes which affect perception. Persons 
differ in their tendency to impose structure on 
an ambiguous field (1, 15, 24), and it seems 
probable that even a rather highly structured 
stimulus may be perceived as ambiguous by 
some. 

There have been few experiments related to 
such hypotheses about the relationship of 
anxiety to objective and subjective structure 
(7, 13, 28) and none in a natural field situa- 
tion. This study utilized five measures of 
anxiety to test patients’ reactions to psycho- 
logical interviews which were varied in the 
degree of structuring provided by the inter- 
viewers. 


METHOD 


Subjects 


The 40 Ss were male veterans selected from con- 
secutive admissions to the Neuropsychiatric Service of 
the Dearborn Veterans Administration Hospital.* All 
cases of suspected psychosis or organic brain disease 
were excluded, as were those involving poor verbal 
ability or low IQ. Selected Ss were all cases of neurosis, 
psychosomatic disease, or psychopathic personality. 
They had to be ambulatory and in sufficient control 
to be on the open wards, relatively new arrivals in the 
hospital, no more than 55 years of age, and without 
previous contacts with the Clinical Psychology section 
before the experimental interview. All were seen within 
one week of admission and were randomly assigned to 
interviewer and conditions. A later check revealed 
equivalence of the groups of Ss receiving Structured or 
Ambiguous interviews on such variables as age (Range 
25-54, Mean 31.7), diagnosis, race, education (Mean 
10.5 years), IQ (Mean 105.4), marital status, number 
of days in the hospital before being interviewed, seda- 
tion on the day of the interview, time of day of inter- 





? The writer is indebted to Thomas P. Crane, Man- 
ager, and J. J. Lasky, then Chief Clinical Psychologist, 
for their help. 
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view, and the number and type of previous psychiatric 
or psychological contacts 


Procedure 


Four psychologists* each conducted clinical 
evaluative interviews, five in an ambiguous 
manner and five in a structured manner, thus 
creating a group of 20 Ambiguous (A) and 20 
Structured (S) interviews. The interviews were 
tape recorded and readings were made of the 
patients’ skin resistance throughout the inter- 
views. Following the interviews, the patients 
were asked standardized questions by £, and 
completed check lists and cardsorts to obtain 
their reactions to the sessions. Five anxiety 
measures were compared with an objective 
index of structuredness of interviewer behavior 
and with a measure of patients’ perception of 
structure. 

Conditions. Situational factors of the inter- 
view setting were strictly controlled. Ss were 
not told that the interviews were experimental; 
the recording and skin resistance apparatus 
were explained as standard hospital tech- 
niques. The post-interview data collecting, 
however, was justified to the Ss as a research 
study. The interviewers were trained in the 
two conditions. In the A condition, the patient 
was told it was considered best if he talked 
about himself in any way he wished, saying 
whatever he thought would be of help to the 
interviewer in understanding him and his life. 
The interviewer accepted anything brought up 
and supported when necessary, but tried to 
avoid giving any but the most general cues 
as to what to talk about. He tried to avoid 
giving advice or asking follow-up questions. 
In the S condition, the interviewer asked 
questions in predetermined areas and followed 
up any lead the patient offered. He was free to 
make his observations or feelings about the 
material known to the patient in the form of 
questions, information, or counse) whenever 
he wished 


Measures of Structure 


Objective Structure. The degree of structuring 
by the interviewers was measured by applying 
Osburn’s method (23) to 10-minute samples of 
the typescripts. This index of degree of struc- 
ture in an interview is based on a weighted 

* The author is grateful to Richard Cutler, Barbara 


Nachmann, Stanley Segal, and David Shapiro for 


their cooperation as interviewers 
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coding of individual interviewer responses and 
on the total number of his responses. The 
coding considers interviewer responses in terms 
of their direction (information-giving or in- 
formation-seeking), referent of the response 
(client referent or situation referent) and gen- 
erality of the response (most general to highly 
specific in content). Since the distribution of 
these scores was considerably skewed, it was 
necessary to use nonparametric statistics in 
the analysis of results. 

Subjective Structure. Subjective structure 
was defined by clinical judges’ ratings of the 
degree of ambiguity as seen by the patient. 
Ratings were based on analysis of transcripts 
of patients’ responses to three open-ended 
questions following their interviews. The ques- 
tions asked how the patient felt the interview 
went, how it could have gone better, and if he 
felt he knew what to talk about in the inter- 
view. Instructions to raters follow: 


Following a clinical first interview with a psycholo- 
gist, patients were asked about it by the researcher 
Certain questions asked them were designed to elicit 
their perception of the degree of structuredness or 
ambiguity of the interview situation they had just 
faced. Degree of structuredness of the interview as ex- 
perienced by the patient is what is being considered now 

The patients had been faced with the task of telling 
the psychologist about themselves. What they should 
or should not tell the psychologist is determined by 
many factors. For example, the degree of defining or 
structuring by the interviewer is a factor. Another 
factor involves the ideas the patient has about what 
is important in his case and what he thinks the inter- 
viewer is interested in. Also the needs the patient might 
have to reveal some things and keep others secret 
would affect his perception of what should be talked 
about. Thus, with some factors coming from within 
the patient, his perceplion of the structure of the task 
might be different from the perception other patients 
might have facing the same interviewer in the same 
situation. Even though an interviewer may feel he is 
providing many cues as to what the patient should 
talk about, some patients might still feel as if the 
situation is very ambiguous. Others might see their 
interview as cut-and-dried, highly structured, even 
though their interviewer doesn’t ask them one specific 
question. They might have a structure of their own 
that they apply 

The responses to certain questions by nine patients 
are presented here. Please rate for each patient how 
you think he perceived the structuredness of the inter- 
view he had. Rate each interview by placing check 
marks on the 6-point linear scales on the rating sheet. 
numbers if unsure of the 


You can check between 


placement 
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1 2 3 4 a 
Structured. 

He had no 
doubts at all 
concerning 
what he should 
talk about. 


Ambiguous 
The patient 
felt he 
didn’t know 
what to talk 
about in the 
interview he 
had 

Rank intercorrelation of five judges rating nine 
patients was .56. E rated all 40 patients. His rank 
with the median of the five raters was .90 


} 


correlatior 


Measures of Anxiety 

Palmar Skin Conductance. Resistance read- 
ings were recorded at 30-second intervals by 
the E in a room adjoining the interview room. 
Following an accommodation period of four 
minutes, a curve of conductance was plotted 
for each patient over a 35-minute period. The 
slope of the curve from the mean of the first 
21% minutes to the mean of the final 10 minutes 
was used as the skin conductance measure of 
anxiety for the interview. The mean and range 
of conductance was equivalent to data re- 
ported by others (28, 31). Range of conduct- 
ance-change through the interviews was from 
—180 to 952 micromhos, with mean change 
of 148.5. 

Anxiety Cards. Following the interview, E 
presented each S with 20 cards, one at a time, 
each containing phrases describing feelings of 


tension or common 


physiological concomitants of tension. The S$ 
was asked to place each card on one of three 
piles indicating that he “Felt that way,” 
“Was not sure,” or “Did not feel that way.” 
Statements on the Anxiety Cards appear 
below. The starred items are nonanxious in 


character. 


tension or absence of 


t concentrate too well 


1. I felt I couldn’ 
I felt I was sweating quite a bit 

3. During most of the talk I felt perfectly at ease 

4. I felt restless most of the time. 

5. I felt my 


* 


stomach was “turning over’’. 


6. I kept wondering what this interview was all about 
7. I felt clear-headed all the way through the talk.* 
8. After about five minutes I felt relaxed and felt 


relaxed till the talk was over.* 

9. I felt nervous through the whole talk. 

10. The talk left me feeling tired 

11. I felt too hot 

12. I was worried about what I was telling him. 


13. I felt pains in my chest sometimes while we were 


talking 
14. I had difficulty in breathing sometimes while we 
were talking 
15 felt very comfortable 
talk right through to the en 


from the beginning of the 
j = 
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16. Sometimes during the talk I could hear my heart 
pounding. 
17. Sometimes my throat felt dry while we were talk- 


ing 
18. I felt my muscles tighten up sometimes during the 
talk. 
. I felt much better during this talk than when I 
talk to most doctors.* 
20. Most of the time I felt rather tense. 


- 
= 


1 


These statements were selected on the basis 
of a pilot study using a longer list of anxiety 
statements. The five Nonanxious statements 
were included with the 15 Anxious statements 
to permit S to profess control of anxiety, but 
only the number of cards bearing anxious 
statements that the patient sorted as “Felt 
that way” yielded his Anxiety Card score. 
The split-half reliability of the scale was .76. 
An item analysis of the anxious cards further 
showed that the scale had high internal con- 
sistency. Obtained scores ranged from 0 to 11 
with a mean of 3.15. 

Anxiety Ratings. Ratings of anxiety were 
made by clinical judges from transcripts of 
10-minute samples of the interviews. Four VA 
staff psychologists were given 17 10-minute 
samples of transcribed interviews and were 
asked to rate each patient response or 10-line 
segment of long responses on a 5-point scale of 
anxiety. The sample of 17 cases included two 
for each interviewer for each condition, plus 
one randomly selected case. Instructions to 
raters appear below: 


Read through the transcript and rate each patient 
response (or 10-line segment of long responses as indi- 
cated). Use the following 5-point scale of Anxiety: 
EE eee eee eee 

Very anxious 


Relaxed, 
not anxious 

Try to consider the scale as linear and continuous 
with 3 as the center-point representing the average 
anxiety level common to patients having their first 
clinical intervfew. You may check between the num- 
bers if unsure of placement of some ratings. 

In reading each patient statement try to get a feel 
for his emotional state at the time he is saying those 
words. Content may or may not be important. Try 
to judge mainly by the way he verbalizes the content. 
Here are some suggested cues. Also use that “3rd ear” 
freely to pick up the patient’s emotional tone. 

Some signs of anxiety: 

Fumbling around with how to present thoughts. 

Sighing, tense laughter, changes in pitch or volume. 

Tripping over words, stuttering, leaving words or 
thoughts half finished. 

Patient stating he is uncomfortable or apprehen- 
sive. 

Angry tone that sounds like reacting to anxiety. 

Some signs of little anxiety: 


Anxious 
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view, and the number and type of previous psychiatric 
or psychological contacts 


Procedure 


Four psychologists’ each conducted clinical 
evaluative interviews, five in an ambiguous 
manner and five in a structured manner, thus 
creating a group of 20 Ambiguous (A) and 20 
Structured (S) interviews. The interviews were 
tape recorded and readings were made of the 
patients’ skin resistance throughout the inter- 
views. Following the interviews, the patients 
were asked standardized questions by £, and 
completed check lists and cardsorts to obtain 
their reactions to the sessions. Five anxiety 
measures were compared with an objective 
index of structuredness of interviewer behavior 
and with a measure of patients’ perception of 
structure. 

Conditions. Situational factors of the inter- 
view setting were strictly controlled. Ss were 
not told that the interviews were experimental ; 
the recording and skin resistance apparatus 
were explained as standard hospital tech- 
niques. The post-interview data collecting, 
however, was justified to the Ss as a research 
study. The interviewers were trained in the 
two conditions. In the A condition, the patient 
was told it was considered best if he talked 
about himself in any way he wished, saying 
whatever he thought would be of help to the 
interviewer in understanding him and his life. 
The interviewer accepted anything brought up 
and supported when necessary, but tried to 
avoid giving any but the most general cues 
as to what to talk about. He tried to avoid 
giving advice or asking follow-up questions. 
In the S condition, the interviewer asked 
questions in predetermined areas and followed 
up any lead the patient offered. He was free to 
make his observations or feelings about the 
material known to the patient in the form of 
questions, information, or counsel whenever 
he wished. 


Measures of Structure 


Objective Structure. The degree of structuring 
by the interviewers was measured by applying 
Osburn’s method (23) to 10-minute samples of 
the typescripts. This index of degree of struc- 
ture in an interview is based on a weighted 


* The author is grateful to Richard Cutler, Barbara 
Nachmann, Stanley Segal, and David Shapiro for 
their cooperation as interviewers 
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coding of individual interviewer responses and 
on the total number of his responses. The 
coding considers interviewer responses in terms 
of their direction (information-giving or in- 
formation-seeking), referent of the response 
(client referent or situation referent) and gen- 
erality of the response (most general to highly 
specific in content). Since the distribution of 
these scores was considerably skewed, it was 
necessary to use nonparametric statistics in 
the analysis of results. 

Subjective Structure. Subjective structure 
was defined by clinical judges’ ratings of the 
degree of ambiguity as seen by the patient. 
Ratings were based on analysis of transcripts 
of patients’ responses to three open-ended 
questions following their interviews. The ques- 
tions asked how the patient felt the interview 
went, how it could have gone better, and if he 
felt he knew what to talk about in the inter- 
view. Instructions to raters follow: 


Following a clinical first interview with a psycholo- 
gist, patients were asked about it by the researcher 
Certain questions asked them were designed to elicit 
their perception of the degree of structuredness or 
ambiguity of the interview situation they had just 
faced. Degree of structuredness of the interview as ex- 
perienced by the patient is what is being considered now 

The patients had been faced with the task of telling 
the psychologist about themselves. What they should 
or should not tell the psychologist is determined by 
many factors. For example, the degree of defining or 
structuring by the interviewer is a factor. Another 
factor involves the ideas the patient has about what 
is important in his case and what he thinks the inter- 
viewer is interested in. Also the needs the patient might 
have to reveal some things and keep others secret 
would affect his perception of what should be talked 
about. Thus, with some factors coming from within 
the patient, his perception of the structure of the task 
might be different from the perception other patients 
might have facing the same interviewer in the same 
situation. Even though an interviewer may feel he is 
providing many cues as to what the patient should 
talk about, some patients might still feel as if the 
situation is very ambiguous. Others might see their 
interview as cut-and-dried, highly structured, evea 
though their interviewer doesn’t ask them one specific 
question. They might have a structure of their own 
that they apply. 

The responses to certain questions by nine patients 
are presented here. Please rate for each patient how 
you think he perceived the structuredness of the inter- 
view he had. Rate each interview by placing check 
marks on the 6-point linear scales on the rating sheet. 
You can check between unsure of the 
placement 


numbers if 
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1 — ——— EEE 
Structured. 

He had no 
doubts at all 
concerning 
what he should 
talk about. 


Ambiguous 
rhe patient 
felt he 
didn’t know 
what to talk 
about in the 
interview he 
had 

Rank intercorrelation of five judges rating nine 
patients was .56. E rated all 40 patients. His rank 


correlation with the median of the five raters was .90 


Measures of Anxiety 

Palmar Skin Conductance. Resistance read- 
ings were recorded at 30-second intervals by 
the E in a room adjoining the interview room. 
Following an accommodation period of four 
minutes, a curve of conductance was plotted 
for each patient over a 35-minute period. The 
slope of the curve from the mean of the first 
214 minutes to the mean of the final 10 minutes 
was used as the skin conductance measure of 
anxiety for the interview. The mean and range 
of conductance was equivalent to data re- 
ported by others (28, 31). Range of conduct- 
ance—change through the interviews was from 
—180 to 952 micromhos, with mean change 
of 148.5. 

Anxiety Cards. Following the interview, E 
presented each S with 20 cards, one at a time, 
each containing phrases describing feelings of 
tension or absence of tension or common 
physiological concomitants of tension. The S 
was asked to place each card on one of three 
piles indicating that he “Felt that 
“Was not sure,” or “Did not feel that way.” 
the Anxiety Cards appear 
starred items are nonanxious in 


” 
way, 


Statements on 
below. The 
character 


I felt I couldn’t concentrate too well 
I felt I was sweating quite a bit 
During most of the talk I felt perfectly at ease.* 
I felt restless most of the time 
I felt my stomach was “turning over”’. 
I kept wondering what this interview was all about 
. I felt clear-headed all the way through the talk.* 
five minutes I felt relaxed and felt 
relaxed till the talk was over.* 
I felt nervous through the whole talk 
The talk left me feeling tired 
I felt too hot 
I was worried about what I was telling him. 
I felt pains in my chest sometimes while we were 


After about 


talking 

I had difficulty in breathing sometimes while we 
were talking 

I felt very comfortable from the beginning of the 
talk right through to the end.* 
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16. Sometimes during the talk I could hear my heart 
pounding 

17. Sometimes my throat felt dry while we were talk- 
ing. 

18. I felt my muscles tighten up sometimes during the 
talk. . 

19. I felt much better during this talk than when I 
talk to most doctors.* 

20. Most of the time I felt rather tense. 


These statements were selected on the basis 
of a pilot study using a longer list of anxiety 
statements. The five Nonanxious statements 
were included with the 15 Anxious statements 
to permit S to profess control of anxiety, but 
only the number of cards bearing anxious 
statements that the patient sorted as “Felt 
that way” yielded his Anxiety Card score. 
The split-half reliability of the scale was .76. 
An item analysis of the anxious cards further 
showed that the scale had high internal con- 
sistency. Obtained scores ranged from 0 to 11 
with a mean of 3.15. 

Anxiety Ratings. Ratings of anxiety were 
made by clinical judges from transcripts of 
10-minute samples of the interviews. Four VA 
staff psychologists were given 17 10-minute 
samples of transcribed interviews and were 
asked to rate each patient response or 10-line 
segment of long responses on a 5-point scale of 
anxiety. The sample of 17 cases included two 
for each interviewer for each condition, plus 
one randomly selected case. Instructions to 
raters appear below: 

Read through the transcript and rate each patient 
response (or 10-line segment of long responses as indi 
cated). Use the following 5-point scale of Anxiety: 

= 2 3 4 5 


Very anxious 


Anxious 


Relaxed, 
not anxious 

Try to consider the scale as linear and continuous 
with 3 as the center-point representing the average 
anxiety level common to patients having their first 
clinical intervtew. You may check between the num- 
bers if unsure of placement of some ratings. 

In reading each patient statement try to get a feel 
for his emotional state at the time he is saying those 
words. Content may or may not be important. Try 
to judge mainly by the way he verbalizes the content. 
Here are some suggested cues. Also use that “3rd ear” 
freely to pick up the patient’s emotional tone. 

Some signs of anxiety: 

Fumbling around with how to present thoughts 

Sighing, tense laughter, changes in pitch or volume 

Tripping over words, stuttering, leaving words or 
thoughts half finished. 

Patient stating he is uncomfortable or apprehen- 
sive. 

Angry tone that sounds like reacting to anxiety. 

Some signs of little anxiety: 
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Smooth running speech, thoughts connected and 
uninterrupted. 

Statements of confidence that don’t smell of de- 
fensiveness. 

Some responses are very short but try to rate them 
as well as you can. For example: The patient may ask 
the interviewer to repeat (P: What?); from the con- 
text try to get a feel for what is behind this “What?” 
At that point the patient may be anxious or relaxed. 


For each rater, the individual ratings of 
responses within an interview were averaged 
to obtain a single score for an interview. Rank 
intercorrelation of the four raters was .59. One 
of the raters, whose correlation with the me- 
dian of the other three raters was .52, judged 
the rest of the interview samples. 

Because of the possibility of bias from the 
rater’s knowledge of the hypotheses and from 
his judgment as to interviewer structuring (by 
reading interviewer responses), a test of pres- 
ence of bias was made. Three additional raters 
were given 16 of the interview samples with all 
interviewer responses, short patient responses, 
and any patient references to lack of structure 
eliminated. The three raters of these specially 
prepared samples had a rank intercorrelation 
of .60, achieving about the same degree of 
agreement as full sample raters. Their median 
ratings of the 16 interviews correlated .64 with 
the median ratings of the full sample raters. If 
corrected for attenuation due to inaccuracy of 
rating, this correlation would actually ap- 
proach unity. The variance unaccounted for 
by the correlation of .64 is almost entirely a 
function of the unreliability of ratings, leaving 
little room for effect by bias on the part of the 
full sample raters 

Speech Cue Scales I and II. Two measures of 
clinical signs of anxiety in speech, scored from 
che 10-minute samples of the transcripts, were 
employed. Their method of development is re- 
ported elsewhere (8). The scales are based 
upon a count of speech cues reflecting disor- 
ganization, emotional involvement, and de- 
fensiveness. The cues included in each scale 
are as follows: 

Cue Scale I 

BR —Breaking in with a new thought, generally by 

breaking into another sentence. 

UN —Unfinished sentence 

IDK—Saying “I don’t know,” 

direct question, but as in resignation, disgust, 


not in answer to a 


or despair 

INT—Interrupted sentence. The thought is inter 
rupted within the sentence but is eventually 
completed in much the same manner as it 


was begun 


ANDREW S. DIBNER 


R —Repeating words or phrases when full words 
or phrases are repeated close in sequence 
within a sentence. 

ST —Stuttering when repeating parts of words or 
when leaving a word unfinished 

Cue Scale II 

L —Laughing. Any kind of laugh or chuckle 

V —Voice change. Change in volume, becoming 
louder or lower. 

SI —Sighing or deep breath. 

By inspection of a table of their intercorre- 
lations, the speech cues were grouped into two 
patterns defining the two scales. With the ex- 
ception of IDK, the cues of Scale I are all 
indicators of speech disruption. Scale II is 
composed of cues related to nonverbal voice 
characteristics. The agreement of two judges 
rating the cues grouped in Scale I was 65 per 
cent, the cues grouped in Scale II, 100 per 
cent. Cue Scales I and II have a Kendall’s 
rank correlation (tau) of —.03 (p 81). 
Unrelated, they were used independently as 
anxiety measures. 

RESULTS 
Structuredness 

Applying the Objective Structure measure 
to the interviews revealed that the interview- 
ers were able to adhere to the conditions. Out 
of the total group of 39 interviews of which 
transcriptions were made, all but two of the 
S interviews had higher Objective Structure 
scores than did the A interviews, and only two 
A interviews overlapped the S distribution. 
Interviewers were found by analysis of vari- 
ance by ranks (16) not to differ significantly 
in adhering to the conditions. Thus, inter- 
views by all interviewers could justifiably be 
pooled for analysis. 

Subjective Structure or the patients’ per- 
ception of structuredness was found to be 
strongly related to the experimental conditions 
(rhis = .71), but the relationship was consid- 
erably lessened when individual variation was 
more fully represented by using a rank corre- 
lation with Objective Structure (tau = .35). 
Thus, a patient’s perception of structure is in 
part determined by the structuring behavior 
of the interviewer, but variance appears to be 
introduced by the unreliability of the measures 
and interpatient personality differences which 
affect perception. 


Measures of Anxiety 


Intercorrelations among the five anxiety 
measures used as independent criteria appear 
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TABLE | 


Tav) or ANXIETY 
40 interviews 


CORRELATION MEASURES 


Anxiety Cue 
Ratings Scale I 


Anxiety Skin Con- 
Cards ductance 


Skin conductance 14 
Anxiety ratings 00 
Cue Scale I 01 
Cue Scale II 07 


286 
Ys — .03 


in Table 1. Only the correlations between the 
Cue scales and Anxiety ratings reach signifi- 
cance, and these are low. Both Cue scales and 
the Anxiety ratings were based on the same 
responses. Judges 


data transcribed 


lning the ot) 
aoing the anxiety 


patient 
ratings were instructed to be 
clues in their estimate of 
Although the two Cue scales 
both 


iware of >} 00°C h 
patient anxiety. 
I sets of 


are themselves uncorrelated, 
speech cues enter into a judge’s estimate of 


nxiety when reading transcripts. 


{mbiguity and Anxiety 


\ test of 
between anxiety and ambiguity was made by 


the hypotheses of relationship 


correlating the measures of Objective and Sub- 
jective structure with the five anxiety meas- 
ures across all Table 2 presents 
these correlations. In anxiety vs. structure of 
the situation as defined by the interviewer 
Objective Structure) all correlations are nega- 
tive, and four of the five reach significance, 
though all are low. The largest negative rela- 
is found between structured- 
ness of interviewer behavior and clinical judg- 
patient anxiety. No significant 
relationship was between Objective 
measure of 


interviews. 


tionship (— .40 
ment of 
found 
Structure and the autonomi 
anxiety (skin conductance). 

Similarly, correlations of anxiety to patients’ 
perception of structure (Subjective Structure) 
are again all negative, but only two anxiety 
measures, the patient’s self-report (Anxiety 
Cards) and his change in skin conductance, 
show significant relationships. 

Thus, the hypotheses of negative relation 
ship between anxiety and structure are borne 
out by four anxiety measures in the case of 
Objective Structure and two in the case of 
Subjective Structure. 

In these analyses, the total group of experi- 


mental interviews was used because the meas- 


ANXIETY 


rABLE 2 


CorrRELATION (TAU) OF STRUCTURE MEASURES AND 


ANXIETY MEASURES ACROSS ALL INTERVIEWS 


Skin 

Con- Cue 
duct- 
ance 


Cue 
Scale 


Anxiety 
Scale I Il 


Cards 


Anxiety 
Ratings 


—.05 |—.18*|— .23* 


— .40**|— .19* 


Objective 
structure 

Subjective — .03 
structure 


27**|— — .04 


22*| — .03 


Note 
tailed test was used for significance of correlations 


Since direction of relationship was predicted, one- 


ures of Objective and Subjective Structure 
could be applied to all interviews, ranging 
them on continua. However, the total group 
was, from another point of view, made up of 
two groups toward which the interviewers were 
instructed to behave differently, as to both 
their early orientation of the patient and their 
structuring behavior throughout the inter- 
view. It was thought that a more rigorous test 
of the hypotheses could be attempted by ex- 
amining their utility within the separate A and 
S groups. Such an analy is would eliminate 
effects due to the difference in initial orienta- 
tion of patients and the difference in expecta- 
tions of the interviewers. Presumably, within 
each group (A or S) each interviewer tried 
to be the same as he was in other interviews in 
that group, not different, as he had to be when 
he interviewed the next patient under the 
other condition. Thus, a test of the hypotheses 
within probably more homogeneous groups 
would be a purer and stricter test. Because V 
was reduced to half, providing a small sample 
of only 20 cases (in some cases 19), and be- 
cause the range of structuredness is reduced, 
it would be expected that error factors would 
have a larger effect in such an analysis, and 
higher correlations would be needed to reach 
significance. By comparing results from the 
analysis of these subgroups with findings 
across all interviews, however, one would be 
perhaps better able to understand the nature 
of variation in the interview situation. 

Table 3 presents correlations of Objective 
and Subjective Structure with anxiety within 
the experimental A and S groups. All the 
relationships are negative within the groups, 
except those between Objective Structure and 
Skin Conductance and between Subjective 
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TABLE 3 


CorRELATION (Tau) OF STRUCTURE MEASURES AND 
ANXIETY MEASURES WITHIN EXPERIMENTAL 
A anr S Groups 


| 
Anxiety | Anxi- 


. ety 
Ratings Cards 


Skin 

Con- ( 
duct- | Sca 
ance 


_ ~~ 
Scale 
le I Il 


A Group — .32* |—.10 — .42** 
ranked on 
objective 
structure 

S Group 
ranked on 
objective 
structure 

A Group 
ranked on 
subjective 
structure 

S Group 
ranked on 
subjective 
structure 


Note.—One-tailed test of significance was used unless 
lation was positive 
*~<.05 


** > <.01. 


Structure and Anxiety Ratings and Cue Scale 
II, all three of which are not significantly dif- 
ferent from zero. 

Several points stand out in this table. One is 
that to a great extent the relationships which 
attained significance on the total sample (see 
Table 2) either attair: or approach significance 
within the separate experimental groups. Sec- 
ond, ‘he relationships do not seem equivalent 
in the two experimental groups examined 
separately. In four out of the five correlatious 
with Objective Structure, the S group showed 
somewhat larger absolute negative relation- 
ships than the A group. Third, Cue Scale I 
showed comparatively large correlations (— .42 
and —.50) with Objective Structure within 
both subgroups, but when the total sample 
was used, the correlation shrank to —.18. 

The first point, concerned with correlations 
between structure and anxiety within groups, 
is consistent with the hypotheses. The within- 
groups analyses apparently eliminated the 
artificial experimental condition of instructing 
interviewers to behave in a structured or 
ambiguous way. With only the natural vari- 
ation in structuring behavior the interviewers 
planlessly provided, the results are still of a 
kind suggested by the hypotheses. 

As for the second point, one interpretation 
is that interviews in the two experimental 


groups necessarily had different psychological 
atmospheres which were related to the experi- 
mental conditions which distinguished them, 
and that variation in structuring behavior by 
the interviewer subsequently had a somewhat 
different effect, depending upon this more 
general variable. It must be remembered that 
there were only two real differences between 
the A and S groups. One was that the inter- 
viewers were instructed to be unstructured in 
the A group and structured in the S; the other 
was that they told patients early in the A 
interviews that they would not structure it 
for them, repeating this orientation several 
times during the interview if necessary. Since 
one group (A) was given to understand there 
would be little structuring and the other (S) 
had no such orientation, one may hypothesize 
that subsequent variability in structuring be- 
havior by the interviewer was differently inter- 
preted and reacted to by patients in the two 
groups. The finding of somewhat larger nega- 
tive correlations between anxiety and struc- 
ture within the S than the A group is consistent 
with this post hoc hypothesis. Seemingly, pa- 
tients whose interviewers said they were going 
to be unstructured and instead were fairly 
structured showed less anxiety than did pa- 
tients whose interviewers started off with a 
structured approach and wound up being fairly 
unstructured. This is made clearer by con- 
sidering the third point brought out in the 
table, that of rather higher correlations be- 
tween Cue Scale I and Objective Structure 
within groups than when the groups were 
combined. 

Scatterplots of Cue Scale I and Objective 
Structure within A and S groups are presented 
in Fig. 1 side by side. The two cases of A 
whose Objective Structure overlapped the 
range of S interviews are circled. Similarly, 
the two S interviews whose Objective Struc- 
ture was so low as to overlap the range of A 
interviews are circled. The lines connect mean 
points of five intervals in the distributions and 
were inserted to illustrate the regressions. One 
can clearly see the anxiety effects in the S 
interviews in which the interviewers gave com- 
paratively little structuring, and the relatively 
low anxiety effect in the A interviews which 
were moderately structured. When all inter- 
views were pooled for analysis, the correlation 
of Objective Structure and speech disruption 
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was lowered by these effects. This interpreta- 
tion has a kind of common sense basis in the 
notion that any orientation offered to an S 
adds to the structure of the stimulus situation 
he faces, even the A orientation in which pa- 
the start of the interview 
would do little struc- 


tients were told at 
that the interviewer 
turing! 

The possibility of interviewer differences in 
arousing anxiety in the patients was explored 
by using the H test (16) over all interviews. 
None of these tests showed significance. Thus, 
there is no evidence that patients showed 
more anxiety with one interviewer than an- 
other. This factor, therefore, seems unim- 
portant in evaluating present results. 

Similarly, because disproportions in topical 
content during the interviews might influence 
findings on anxiety, an analysis was made of 
topics discussed during the 10-minute samples 
of all interviews. Interviews of A and S types 
were tallied on a classification of topical areas 
suggested by the content and subjected to chi 

yuare test. No significant differences emerged. 


DISCUSSION 


A methodological factor may have had some 
effect on the inconsistent relationships of the 


anxiety measures both with each other and 
with the two measures of structure. Some of 
“global” ones, derived 


the measures were 


either from data collected during the whole 
interview or from impressions after the irter- 
view was completed; other measures were de- 
rived only from data obtained during the 10- 
minute samples, which began 20 minutes after 
the start of the interview. Higher relationships 
were found, in general, among measures of 
the global type or among measures of the 
time-sample type than when the two types 
were compared. In genera], the global meas- 
ures of anxiety (Anxiety Cards and Skin Con- 
ductance) related better to the global index of 
structure (Subjective Structure) than they did 
to the time-sample measure. Similarly, the 
anxiety measures which were derived from the 
time sample (Anxiety Ratings and Cue Scales 
I and II) related better to the measure of 
structure derived from the same sample than 
they did to the global measure of structure. 
There are, however, some significant rela- 
tionships between time-sample measures and 
global measures. More importantly, Objective 
Structure, based on consideration of the inter- 
viewer’s behavior in the 10-minute samples, 
was found to be highly correlated with an 
estimate of his behavior based on the first five 
minutes of the interview (rho = .85), and may 
thus be taken as an adequate estimate of the 
structuredness of the whole interview. There- 
fore, any relationships of other measures to 
Objective Structure may be interpreted as 
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meaningful in terms of the whole interview, 
not just the 10-minute sample. 

However, these methodological considera- 
tions do not adequately explain the lack of 
correlation among the anxiety measures. The 
two speech measures, both derived from the 
time sample, are uncorrelated. Similarly, Skin 
Conductance and Anxiety Cards, both global 
measures, have no significant relationship. An 
understanding of the lack of correlation of 
measures, each of which has theoretical ra- 
tionale as being an index of that emotional 
disturbance called anxiety, sheds light on the 
nature of the concept of anxiety. The Anxiety 
Cards assess what the patient can or is willing 
to admit about the mental or somatic discom- 
fort he experienced. They reflect tension of 
which the patient is consciously aware, al- 
though, of course, the degree of tension of 
which he is aware may be quantitatively more 
or less than what he indicates in his choice of 
Anxiety Cards. Skin Conductance, on the 
other hand, is one measure over which the 
patient has little conscious control (22). Pre- 
vious studies have claimed both that Ss are 
aware and unaware of changes in skin con- 
ductance (19, 32). Studies generally agree that 
it represents “built up’’ (unexpressed) tension. 
Perhaps there is a threshold of awareness for 
skin conductance changes which varies with 
the individual. The two uncorrelated Cue 
scales represent speech variables indicative of 
“emotionally involved” or disorganized verbal 
behavior. Upon analysis, some of these can 
be understood as direct instances of disor- 
ganization (incomplete sentence or breaking 
in of new thoughts) and others as defenses 
against or attempts to control disorganization 
(repetition or modification of words and 
phrases). Very likely, these cues are not sub- 
ject to the person’s awareness. The Anxiety 
Ratings, which are made by a clinical judge 
from the patient’s verbal productions, can 
also be considered as based on a variety of 
cues of emotional disturbance or defense 
against disorganization of which the patient is 
essentially unaware. 

The suggestion of relationship between 
Anxiety Cards and Skin Conductance (tau = 
.14) may be a reflection of the “aware” ele- 
ment of Skin Conductance with the “Aware- 
and-willing-to-admit-it” element in_ the 
Anxiety Cards measure. In any case, these 


low correlations force the conclusion that if 
all of these measures do indeed refer to anx- 
iety, anxiety is a combination of “emotional” 
phenomena, each bearing little relationship to 
the other, some being available to the person’s 
awareness while others, perhaps, are not al- 
though they are identifiable by an observer. 
Lacey and Van Lehn (18) and others (6, 21, 
22) offer a theory of emotional reaction in 
which each individual is thought to have his 
own hierarchy of patterns of physiological 
reaction, in opposition to the view (3, 4) that 
certain specific emotional states evoke specific 
reactions in all people. Also, there is support 
from personality studies of psychic defense 
reactions that persons not only have individual 
hierarchies of defenses but that some use their 
defense system over all conflict situations, 
whereas others employ various reaction mech- 
anisms, depending on the nature of the con- 
flict situation and its relevance to the indi- 
vidual’s unique personality dynamics (11, 12, 
27, 29). This approach to the study of per- 
sonality assumes that there are certain reac- 
tion mechanisms which are widely available 
to organisms as a group but which have vary- 
ing salience, perhaps over wide temporal or 
situational spheres or changing with different 
situations, for different individuals. Applying 
this type of consideration to the indices of 
anxiety, one can imagine that each may in- 
deed be a measure of anxiety but that indi- 
viduals differ markedly in their expression of 
tension in speech, in autonomic changes, and 
in awareness of mental and somatic tension. 
Moreover, within the individual, there may be 
changes in the pattern of reaction mecha- 
nisms, depending on his threshold of tolerance 
for tension. Anxiety, then, may be an emo- 
tional state not subject to stable measurement 
by any one behavioral or physiological! yard- 
stick in any group of individuals, or in any 
one individual in different circumstances of 
stress or motivation. 

Thus far, perception of structure has been 
considered as subject to influence by the ob- 
jective nature of the situation and by the pa- 
tient’s anxiety. Actuzlly, the relationship be- 
tween perceived structure and patient anxiety 
may be a reciprocal one, the patient’s report 
of structure being influenced to some extent 
by the degree to which he felt threatened by 
the interview. The findings in regard to Sub- 
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jective Structure and anxiety are more easily 
understood in this light. The two anxiety 
measures which showed correlation with Sub- 
jective Structure were Anxiety Cards and 
Skin Conductance. These measures are prob- 
ably most representative of the patient’s 
awareness of tension. Frenkel-Brunswik (10) 
found, in this connection that those who are 
tolerant of ambiguity (who perceive the situa- 
tion as ambiguous if it objectively is) are able 
to tolerate the anxiety involved, while those 
rigid persons who perceptually engraft struc- 
ture on an ambiguous situation do so to avoid 
anxiety. The present findings support Frenkel- 
Brunswik’s in that perception of ambiguity 
was indeed related to “feeling anxious.” The 
findings in relation to Objective Structure, 
however, demonstrate that people may be 
anxious at a level below awareness when fac- 
ing an ambiguous situation. Was such the case 
with Frenkel-Brunswik’s intolerant group? 

Finally, although the goals of therapy most 
often involve reduction of anxiety—not elimi- 
nation, but reduction to a manageable and 
noncrippling level (26, 30)—theorists differ as 
to the vaiue of anxiety as a therapeutic agent. 
Analytic theorists depend on anxiety as a 
motivator for progress (2, 25), whereas client- 
centered therapists feel that anxiety reduction 
must be a constant goal “... because only in 
a non-threatening situation the individual’s 
capacity for growth is freed” (5, p. 360). In 
any case, this study demonstrates that inter- 
viewers can deliberately vary the degree of 
structure in the situation. In view of the find- 
ings of a relationship between Objective 
Structure and various types of anxiety, this 
suggests that structuring behavior can be 
planfully used by a clinician as a technique 
for the therapeutic control of anxiety when he 
thinks his patient has surpassed or failed to 
achieve some optimal anxiety level. The re- 
search implications of this possibility are clear 
and important. 


SUMMARY 


Considering anxiety in its meaning as a 
momentary state, not as a personality varia- 
ble, this study tested the hypotheses that 
anxiety is related to the ambiguity of the 
presenting situation as objectively measured 
and as perceived by the subject. 

Forty neuropsychiatric patients in 


the 
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Dearborn VA Hospital were given clinical 
evaluative interviews by four interviewers. In 
half the interviews, the interviewer gave few 
clues to guide the patient in discussing his life 
and problems (Ambiguous condition). In the 
other interviews, the interviewer took an ac- 
tive part in guiding the discussion (Structured 
condition). All interviews were tape-recorded, 
and a record was obtained of patients’ skin 
conductance throughout the interview. Pa- 
tients were then questioned as to their reac- 
tions to the talks. 

Two measures of structuredness were ob- 
tained: a measure of the structuring qualities 
of the interviewer’s verbal behavior as ana- 
lyzed from the typescripts, and a rating of the 
patient’s perception of the structuredness of 
the interview situation as analyzed from his 
post-interview report. Five different measures 
of anxiety were used: changes in palmar skin 
conductance, ratings of anxiety by a clinical 
judge reading samples of the interview tran- 
script, the patient’s own report of tension by 
means of a modified check-list, and two differ- 
ent sets of indices of disturbed speech. 

Four out of the five anxiety measures 
showed significant relationships with objec- 
tively measured structure, and two out of the 
five showed significant relationships with 
structure as the patient perceived it, partially 
confirming the hypotheses. 

The general lack of correlation of the anx- 
iety measures with each other and their pat- 
terns of relationship to the two measures of 
structure were discussed in terms of the units 
of information on which the measures were 
based, and in terms of the multidimensional 
nature of anxiety response mechanisms and 
the interrelationships of anxiety, perception, 
and personality structure. Findings of the 
study were discussed in relation to a theory of 
anxiety and to psychotherapy. 
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THE APPLICATION OF LEARNING THEORY TO THE TREATMENT 
OF TICS 


AUBREY J. YATES! 
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Institute of Psychiatry, Maudsley Hospital, London 


INCE the classic work in 1907 by Meige 
and Feindel (19) on the treatment of 
tics as examples of learned habits, little 

progress in this field been reported, 
although their results were encouraging. In 
1932, however, Dunlap (5) claimed that un- 
desirable habits (including could be 
eliminated by making the subject repeat them 
voluntarily and deliberately. But Dunlap gave 
few details of his procedure and no experi- 
mental demonstration of its validity, merely 
asserting that the tics disappeared rapidly. 
While his technique of negative practice has 
been accepted (at least in so far as the treat- 
ment of tics is concerned) in at least one 
modern textbook of psychology (1), an ade- 
quate rationale and experimental confirmation 
of its utility are still lacking. 

More recently, attempts have been made to 
derive methods for the treatment of behavioral 
disorders from the constructs of modern learn- 
ing theory (3, 12, 13, 16, 21, 22, 25, 26).* This 
paper reports a rationale for such treatment 
in the case of a tiqueur and the results of 
experiments carried out in order to test the 
validity of the deductions made from the 
theory. 


has 


tics) 


Tue Tic as A LEARNED HABIT 


The following theoretical model is proposed 
to show that some tics can be conceptualized 
as learned responses of a particular kind. The 
justification for proposing such a model lies in 
the deductions which are derived from it con- 
cerning treatment 

It is hypothesised that 


! Now at the University of New England, Armidale, 
New South Wales, Australia. 

2 The author would like to thank J. Inglis and H. 
G. Jones for helpful discussion on the theoretical part 
yf the paper, and Linforu Rees for permission to carry 
yut the investigation 

* Earlier experiments have been reviewed by Hil- 
gard and Marquis (9), Crafts et al. (4), and Lehner (15). 

‘It is not suggested that tics can arise only in the 
way hypothesized. Some tics appear to arise from simple 
imitation, expecially in young children, or as an avoid- 
ance response originally arising from some irritating 
but nontraumatic stimulus such as a tight collar. The 


some* tics may 


ur 


be drive-reducing conditioned avoidance re- 
sponses, originally evoked in a highly trau- 
matic situation. In this situation, intense fear 
is aroused and a movement of withdrawal! or 
aggression is made. If the movement produces 
or coincides with the cessation of the fear- 
inducing stimulus, it acquires strength through 
reinforcement. On subsequent occasions, 
through stimulus-generalization (including 
internal symbolization), conditioned fear 
(“anxiety”) may be aroused, which is then 
reduced by the performance of the movement. 
In this way, the tic comes to be elicited by a 
large variety of stimuli and eventually achieves 
the status of a powerful habit. 

Several points should be noticed about this 
model. First, although no direct experimental 
evidence is available to show that tics do arise 
in this way, two lines of investigation suggest 
that the formulation a reasonable one. 
Wolpe (26), Solomon and Wynne (24), and 
others have demonstrated that animals placed 
in a highly traumatic situation develop con- 
ditioned avoidance responses which appar- 
ently reduce the anxiety associated with the 
original situation and which are highly resist- 
ant to extinction. Again, clinical investiga- 
tions of the history of tiqueurs (8) have led 
some workers to suggest that the tic is an 
avoidance response arising originally in a 
highly traumatic situation, especially in 
childhood. Second, the kind of response evoked 
may be determined partly at least by the 
mode of response characteristic of the subject 
in any stressful situation. Malmo and his col- 
leagues (17, 18) have presented evidence sug- 
esting that some neurotics react to stress 
with cardiovascular responses, whereas others 
react to the same stress with muscular re- 
sponses. Experiments by Lacey, Bateman, 
and Van Lehn (14) similarly suggest that the 
nature of the stressful situation is not a major 
variable, at least so far as autonomic func- 
tions are concerned. 

The suggested model follows closely the 


is 


proposed treatment, however, should be applicable to 
any tic which is a learned response 
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two-factor theory of learning expounded by 
Mowrer (21). In terms of Hullian (10) learn- 
ing theory, the reaction potential of the tic at 
a given moment may be conceived as a multi- 
plicative function of the habit strength (sHx) 
of the tic (determined mainly by the number 
of times it has previously been evoked) and 
the momentary drive strength of anxiety (D), 
which fluctuates from time to time. Since 
habit strength increases as a simple negatively 
accelerated positive growth function and 
eventually reaches an asymptote, further per- 
formance of the tic cannot increase its habit 
strength beyond a given point. 


DERIVATION OF METHOD OF TREATMENT 


According to the model, then, the tic may 
be treated as a simple learned habit which has 
attained its maximum habit strength. In 
terms of the theory, it should be possible, 
therefore, to extinguish the habit by building 
up a negative or incompatible habit of “not 
performing the tic.” If the subject is given 
massed practice in the tic, then reactive in- 
hibition (Ig) should build up rapidly. When 
I, reaches a certain critical point, the patient 
will be forced to “rest” or not perform the tic. 
This habit (sly) of not performing the tic will 
be associated with drive-reduction due to the 
dissipation of I, and hence will be reinforced. 
With repeated massed practice, therefore, a 
negative habit (“‘not doing the tic’’) will be 
built up, incompatible with the positive habit 
of doing the tic. Furthermore, the repeated 
voluntary evocation of the tic should nol 
serve to increase the habit strength of the tic, 
since it is already asymptotic and consequently 
not subject to strengthening by massed prac- 
tice. 

At any given moment, therefore, the effec- 
tive reaction potential of the tic (ie., the 
ability of the patient to perform the tic) will 
be a resultant of the forces in the equation 


aka = (sHyXD) _— (Ip + slr) 


As gly increases, sE, should diminish to a 
point at which the excitatory and inhibitory 
parts of the equation are equal. The response 
tendency of the tic will then be below thresh- 


* With massed practice the tic will also, for the 
most part, cease to be reinforced, because, since anx- 
iety is only intermittent in S, it will be unaccompanied 
by drive reduction. 


old level, and behaviorally, extinction should 
be complete. During this process, of course, 
eEr would be expected to show fluctuations, 
Since it is affected by a number of factors 
(notably drive level) which are uncontrolled 
(sO). Any increase in drive may, of course, 
temporarily mask the effects of an increase in 
sly. It is interesting to note that Calvin ef al. 
(2) recently produced evidence to show that 
the continued evocation of a habit under con- 
ditions of massed practice in rats eventually 
leads to its extinction, although the process 
may be a slow one. 

The optimum conditions of massed practice 
under which conditioned inhibition (sI,) may 
be expected to grow most rapidly and effec- 
tively are unknown with respect to the ex- 
tinction of tics. The present experiments 
represent, therefore, an attempt to discover 
these conditions. 

THE PATIENT 

The S was a female psychiatric patient, 25 
years old, who was highly cooperative and 
apparently had a strong desire to get well. 
She was of high average intelligence, and was 
markedly neurotic and slightly more extro- 
verted than the average normal as measured 
by the Guilford-Martin Personality Inventory 
of Factors S, T, D, C, R.* Four clear-cut tics 
were chosen for the experiment: a complex 
stomach-contraction oreathing tic, a nasal 
“explosion” (expiration), a coughing tic, and 
an eye-blink tic. These tics appeared to have 
started originally following two very trau- 
matic experiences about ten years previously 
when S felt that she was being suffocated 
while undergoing anaesthesia; she said she 
was terrified that she was going to die and 
struggled madly. She could not bear the 
thought of an anaesthetic mask and could not 
tolerate any object’s being placed over her 
face. Superficially, at least, these tics seemed 
to be cond..ioned avoidance responses origi- 
nally established in a traumatic situation.’ 
She also said she felt a need to do the tics and 
experienced relief when they occurred (i.e., 
they were drive-reducing). They varied mark- 

* Eysenck (7) has shown that these scales are 
reasonably good measures of neuroticism (C and D 
scales) and introversion-extraversion (R scale). 

7 Presumably, escape in such a situation from the 


noxious stimulation was effected by the patient’s be- 
coming unconscious. 
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TABLE 1 


CHANGES IN MEAN FREOUVENCY OF Four Tics UNpER Conpirions OF VOLUNTARY EVOCATION 


Stomach Eyeblink 


-° ~@ 


Sessions M o Sessions 
1-50 52.83 4.06 1-50 36.96 4.74 
51-100 40.54 3.90 51-100 31.96 2.48 
112-150 41.69 2.58 101-150 31.30 1.63 
151-175 19.40 2.15 151-200 26.87 2.00 
176-200 15.26 1.57 
201-220 11.95 1.09 201-220 26.04 1.60 
221-285 9.67 1.04 221-285 20.34 1.74 
286-315 8.11 6.75 286-315 21.14 1.48 


edly from time to time in frequency of occur- 
rence. It was decided to treat the four tics 
concurrently but independently 

| 


E}}PERIMENT I 

Sessions 1-100 
The 
general hyp 
significant 
voluntarily 
[wo sessions per day were carried out, each con 
for each tic under 
one minute’s rest 
vetween each set of five trials.* S was instructed 
h tic as accurately as possible and to repeat 


the 
to a 


purpose of this experiment was to test 


massed practice 
the ability of S to respond 


nesis t 


that leads 


dcecrement in 


sisting ol hve one-minute trials 


conditions of massed practice, with 
to re 
produce eac! 
it without pause during the practice periods. No stress 
was laid on speed. A complete session lasted exactly 
45 was carried out at the 
hospital each day under supervision; S carried out the 
The order of practising the tics was 
varied at random. The score recorded was the number 
of tics per minute as counted by Z in the case of the 
nasal and throat tics and by S in the case of the stomach 
rhis procedure was followed for 100 


minutes. One such session 


other on her own 


and eyeblink tics 
sessions (50 days 

The frequency of occurrence of each tic per minute 
under conditions) almost invariably showed a 
steady decline from trials one to five within a single 
session. It was considered that the average of the five 
trials per session would give a reliable estimate of the 
strength of the tic under conditions of voluntary evoca 
tion in any particular session. The mean score for the 
first 50 sessions was compared with the mean score for 
the second 50 sessions for each tic separately. 

The results (Table 1) are remarkably consistent for 
the four tics. The mean score for the second set of 50 
sessions is significantly lower than the mean for the 
first set of 50 for each tic (values of ¢ were as follows: 
stomach, 15.42; eyeblink, 6.60; nose, 11.93; throat, 
12.68; p was < .001 in each case). The results, there- 
fore, are in accordance with prediction and offer sup- 
port for the general theory 


test 


EXPERIMENT I] 
Sesstons 101-150 


Three hypotheses were tested over Sessions 101-150. 
Hypothesis I. It seemed reasonable to contend that 


’ Designated henceforward the standard procedure 


Throat 


Nose 

Sessions M a Sessions M o 

1-50 45.56 5.41 1-50 42.00 4.17 
51-100 35.11 3.02 51-100 32.18 3.55 
112-150 32.06 3.13 101-150 31.52 3.53 
151-180 23.71 2.73 151-200 23.91 2.25 
201-220 18.97 4.25 201-220 20.27 3.33 
221-248 13.23 2.12 221-285 15.51 2.10 
286-315 8.05 1.90 286-315 12.88 1.73 


further massed practice under standard conditions 
would produce a further decline in the tic frequency. 
To test this hypothesis, the eyeblink and throat tics 
were practised under standard conditions. 

Hypothesis II. The difference between means of the 
first and second sets of 50 sessions in Experiment I 
might be due to Ip not completely dissipating between 
sessions. To test this hypothesis, practice on the stom- 
ach tic was discontinued for 11 sessions; during the 
ensuing Sessions 112-150, standard conditions were 
resumed. If the decline in Experiment I were due to 
Ip alone, the mean for Sessions 112-150 should not be 
lower than that for Sessions 1-50, i.e., recovery of 
function should take place. 

Hypothesis III. To test the notion that the growth 
of slg would be facilitated by increasing the duration 
of massed practice, the nasal tic was practised for 15 
min. daily without a break under instructions to keep 
the amplitude as strong and as constant as possible. 
This procedure apparently had the desired effect of 
increasing the number of involuntary rest pauses, a 
crucial factor in the growth of glp, since in the last 
few minutes of the period, the rate of responding 
typically dropped to 3-4 per minute. Five sessions 
(one per day) were carried out in this way, the pro- 
longed period always following practice on the other 
tics to avoid interference with them. No standard 
practice on the nasal tic was carried out at this time 
During Sessions 112-150, standard conditions were 
imposed. It was predicted that the mean jor Sessions 
112-150 would be significantly below that for Sessions 
51-100. 

The results of this experiment are reported in Table 
i. Hypothesis I was not confirmed. For both eyeblink 
and throat tics, the mean frequency per session was 
not significantly below that for Sessions 51-100; (for 
the eyeblink, ¢ 1.570, » = > .05; for the throat, 
# = 0.927, p > .05). However, no recovery of func- 
tion took place, since in both cases the mean for Sessions 
101-150 was significantly lower than that for Sessions 
1-50; (for the eyeblink, ¢ = 7.982, p = < .001; for 
the throat, ¢ = 13.563, p = < .001). 

Hypothesis II was not confirmed. The mean for Ses- 
sions 112-150 was significantly lower than the mean for 
Sessions 1-50 (¢ = 14.928, p = < .001). However, the 
mean for Sessions 112-150 was not significantly differ 
ent from the mean for Sessions 51-100, (¢ = 1.589, p 
= > .05). 

Hypothesis III was confirmed. The mean for Ses- 
sions 112-150 was significantly lower than the mean 
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for Sessions 51-100, (¢ = 4.655, p = < .001). Although 
the decline was not very large, it should be remembered 
that only five sessions of increased massed practice 
were given. 

During this experiment, therefore, S seemed to be 
on a plateau with respect to three of the tics, but re- 
covery of function did not take place, even when prac- 
tice was entirely discontinued for a week. Further, it 
was apparent that the growth of sIp could be facili- 
tated in some degree by increasing the amount of 
massed practice. 

EXPERIMENT IIT 
Sessions 151-200 

Three hypotheses were again tested over Sessions 
151-200. 

Hypothesis I. Practice of the eyeblink and throat 
tics was continued under standard conditions on the 
assumption that during Sessions 50-150 S had been 
on a plateau with respect to these tics; it was pre- 
dicted that further practice would eventually induce a 
decline. 

Hypothesis I1. It seemed probable that within the 
standard practice conditions, the intensification of the 
response would increase the rate of decline. Dunlap (S) 
stressed the importance of instructing S to pay careful 
attention to the tic. S was, therefore, asked to reproduce 
the stomach tic as precisely and as intensively as pos- 
sible. The conditions of practice were otherwise stand- 
ard. 

Hypothesis III. The prolongation of the period of 
practice for the nasal tic was continued. However, the 
method was varied. Each day, at the end of the hospital 
session only, S did 15 min. massed practice of the nasal 
tic as in Experiment II. In the second session of that 
day, the nasal tic was not practised (since the Ip asso- 
ciated with the 15-min. practice period might not have 
wholly dissipated and could affect. performance). How- 
ever, at the beginning of the hospital session each day, 
the nasal tic was practised under standard condi- 
tions. This served as a measure of the influence of 
the previous day’s intensive practice. The standard 
nasal practice was followed by the throat and eye- 
blink tics, which always preceded the stomach tic to 
avoid the effects of fatigue induced by that tic under 
intensive conditions. In this way, at least 12 hours 
elapsed between the intensive and the standard nasal 
practises. During this experiment, there were 20 in- 
tensive and 30 standard practice periods for the nasal 
tic.’ 

The results, as shown in Table 1, supported Hypothe- 
sis I. The mean for Sessions 151-200 was significantly 
lower than the mean for Sessions 101-150 for both 
eyeblink and throat tics, (for the eyeblink, ¢ = 12.137, 
p = < .001; for the throat, ¢ = 9.931, p = < .001). 

With regard to Hypothesis II, the results obtained 
with the stomach tic were not directly comparable 
with those obtained in Sessions 1-150 because of the 
altered instructions within the standard conditions. 
When, however, Sessions 151-175 were compared with 





*The numbers are unequal because S was later 
allowed to do the 15 min. practice on her own on week- 
ends. 


Sessions 176-200, a significant decline was apparent in 
the latter set, (¢ = 7.793, p = < .001). The hypothesis 
may, therefore, be regarded as supported. 

Hypothesis III was also supported. Extensive massed 
practice led to a striking decline in frequency, the mean 
for Sessions 151-180 being significantly below that fo 
Sessions 112-150 (¢ = 11.608, p = < .001). 

The results of Experiment IIT suggested, therefore, 
that the standard procedure led to a continuing decline 
in frequency, that extensive massed practice had a 
very striking effect on frequency of response, and that 
increasing the intensity of response within the standard 
situation also led to a decline in frequency. 


EXPERIMENT IV 


Sessions 201-220 


The purpose of Experiment IV was to test more 
rigorously the hypothesis that the decline in frequency 
of the four tics was a function of the growth of slr 
rather than of the accumulation of Ip. When practice 
on the stomach tic was discontinued in Experiment II, 
S later reported that she had been unable to inhibit 
it during the 15-min. practice of the nasal tic. Hence, 
the earlier experiment seemed inconclusive. S was, 
therefore, given a two-week holiday during which no 
practice of any kind was carried out. It was predicted 
that no recovery of function would take place. Follow- 
ing this interlude, the four tics were practised under 
standard conditions only, for 20 sessions. 

The results, as shown in Table 1, indicated that the 
eyebiink tic showed no change following complete rest, 
(¢ = 1.00, p = > .05). The other three tics all showed 
a significant further decline during Sessions 201-220 
compared with the level reached in Experiment III 
(for the stomach, t = 8.37, p = < .001; for the nose, 
t = 3.98, p = < .001; for the throat, ¢ = 4.49, p = 
< .001). The decline was particularly striking in the 
case of the nasal tic. The hypothesis that no recovery 
of function would occur seemed, therefore, to be sup- 
ported, but the significant decline following rest in the 
case of three tics remained to be explained. 


EXPERIMENT V 

Sessions 221-275 

Three hypotheses were tested in this experiment. 

Hypothesis I. Massed practice under standard con- 
ditions was continued for the eyeblink and throat tics, 
the prediction being that a further significant decline 
would result 

Hypothesis II. Massed practice under standard con- 
ditions but with intensification of the response was 
continued for the stomach tic, the prediction being 
that a further significant decline would result. 

Hypothesis Ii1I. It was decided to test whether the 
rate of growth of slp could be significantly increased 
by an increase in the rate and intensity of repetition of 
the nasal tic. To test this hypothesis, the 15-min 
period of massed practice was used. This period was 
divided into periods of 5 seconds of massed practice 
followed by 10 seconds of rest. During the 5 seconds, 
S performed the nasal tic as rapidly and as intensively 
as possible. It was expected that Ip would accumulate 
more rapidly and would dissipate less between indi 





LEARNING THEORY AND TREATMENT OF TICS 


STOMACH 
sa 


I ese 








FREQUENCY 
ORK ® 
O_O O 


— 
O00 


| 
| 
| 
| 





lOO 200 300 
SESSIONS 


O 


> 


= ie 


C30 
Li 
(720 
uy; 


EYEBLINK 





\ 
WA 





OO 200 300 
0 SESSIONS 





Fic. 1, CHANGES IN FREQUENCY OF Four Tics VOLUNTARILY EVOKED UNDER 
Various CONDITIONS OF MAssED PRACTICE 


vidual tics. Because 10 seconds of rest would thus be 
expected to have a high reward value, glx should be 
more strongly reinforced. Apart from the change in 
procedure for the 15 minutes of practice, the method 
adopted was identical with that described in Experi- 
ment IIT 

The results, as shown in Table 1, indicated that all 
three hypotheses were confirmed” (for the eyeblink, 
t = 13.67; for the stomach, ¢ = 8.26; for the nose, / 
= 5.21: for the throat, ¢ = 6.16; » was < .001 in each 


case 
EXPERIMENT VI 
Sessions 276-315 


The purpose of this experiment was to test the 
hypothesis that a very prolonged period of massed 
practice followed. by a prolonged rest would lead to a 
significant decrement in response. It had been found that 
in Sessions 201-220 a highly significant decline took 
place on three of the four tics following two weeks’ 
rest, the decline being most striking in the case of the 


° Table 1 shows the means for Sessions 221-285 for 
eyeblink, throat, and stomach tics. Sessions 276-285 
were, however, actually performed for these tics during 
Experiment VI while the nasal tic was being given 
four one-hour practices 


nasal tic. The possibility arose, therefore, that the 
crucial set of conditions for increasing sIx rapidly lay 
in a combination of very intensive practice followed 
by a long rest. In general, it was hypothesized that 
optimum increments of slp would be obtained if 
sufficient massed practice were given to allow the ac- 
cumulation of a maximum amount of Ip, followed by a 
rest period sufficient to allow the corplete dissipation 
of Ix. To test this hypothesis, four one-hour sessions 
were given, during which uninterrupted massed practice 
of the nasal tic was carried out. The other tics were 
not practised during these sessions, each of which was 
separated from the next by a three-day interval. There 
were no standard practices of the nasal tic during this 
period. The other tics were practised under the usual 
conditions during the second daily session only ({i.e., 
once per day, constituting Trials 276-285). At the 
end of this period, the patient was given a three-week 
holiday during which no practice of any kind was 
carried out. Following this rest, 30 standard sessions 
(286-315) were given for all four tics. 

The results, as shown in Table 1, indicate that the 
nasal tic showed a very significant decline in the mean 
for Sessions 286-315 compared with Sessions 221-248 
immediately preceding the four one-hour practices 
(¢ = 7.99, p = < .001). Throat and stomach tics 
showed a significant but much smaller decline (for 
the throat, ¢ = 6.43, p = < .001; for the stomach, 
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t= 832, p = 001). The eyeblink tic showed a 
small but significant recovery (¢ = 2.32, p = .02) 
Thus, the hypothesis was generally supported 

At the conclusion of this experiment, the 
score for the last 30 sessions (286-315) was compared 
with the mean score for the first 30 sessions (1-30)." 
The percentage decrement in the frequency of volun- 
tary responding for each tic was as follows: nasal, 84.14; 
throat, 68.83; stomach, 63.42; eyeblink, 51.47. The 
course of decline is shown in Fig. 1, in which each 
point represents the average of five sessions (i.e., 25 
one-min. trials). Gaps in the curves indicate rests or 


mean 


changes in treatment 


CHANGES IN INVOLUNTARY TICS 


It is notoriously difficult to obtain a reliable 
estimate of the rate of involuntary responding 
in tics, which fluctuates markedly from day to 
day, even from hour to hour. While a change 
in the rate of response would be very difficult 
to assess reliably, a complete cessation of in- 
voluntary responding outside the test situa- 
tion would be apparent to S, to her acquaint- 
ances, and to E. S had not reached this clinical 
criterion at the end of Experiment VI. At the 
end of the series, however, she was asked to 
write a frank account of her experience over 
the period of treatment, both in relation to 
the tics and to her general adjustment. 

Her report indicated that the nasal tic “has 
almost vanished. It returns occasionally 

..every now and then... but defi- 
nitely it does not plague me every few minutes 
as before. It is absent for days on end.” The 
eyeblink and throat tics were reported as 
sharply reduced in frequency. The stomach 
tic, however, had changed very little. She also 
reported that she had “improved in all de 
partments generally,” was “able to relax in a 
bus or train, able to read or execute a cross- 
word puzzle.” There was continual reference 
to the fact that not doing the movement did 
not leave her frustrated. She concluded that 
she was, “not tired of the treatment, if any- 
thing now that I have a taste of success the 
desire is greater.” She also pointed out that 
“one of the striking features is that my closest 
friends notice the change.” The specific and 


most 


’ 


general changes were confirmed by her physi- 


cian.” 


«For the stomach tic, the comparison was between 
Sessions 151-180 and 286-315 
2% During these experiments 


her physician for a few 


S received no other 


treatment but saw minutes 


about once a month 


DISCUSSION 


The results obtained in these experiments 
support the theory proposed to explain the 
genesis of certain tics and the deductions made 
from it regarding a rational] method of treat- 
ment. Within the test situation there was ob- 
jective evidence of a striking decline in the 
rate of voluntary responding for all tics. The 
rate of decline was most striking in that tic 
(nasal) which received the largest amount of 
massed practice. There was no evidence of any 
recovery of function following prolonged rest. 
The data further suggest that the optimum 
conditions for the production of conditioned 
inhibition may be a period of very prolonged 
massed practice, leading to the accumulation 
of large amounts of reactive inhibition, fol- 
lowed by a very prolonged rest to allow for 
the total dissipation of reactive inhibition. 
There was subjective evidence suggesting that 
conditioned inhibition generalized beyond the 
test situation itself and that most generaliza- 
tion occurred with respect to the tic which had 
received most massed practice. It is not sug- 
gested, however, that the results outlined 
above can be unequivocally interpreted. Under 
clinical conditions of testing and with limited 
time available, perfect experimental control 
and balance of design proved impossible to 
obtain. Thus, it is possible that the decline in 
Sessions 286-315 in the nasal tic was a func- 
tion entirely of the amount of massed practice 
given and not of the combination of practice 
and rest. Although no standard practices were 
carried out on the nasal tic between the four 
one-hour periods, S reported that she became 
much worse outside the test situation during 
this time. However, during the three weeks’ 
rest period which followed, she reported that 
she rapidly lost the tendency to perform the 
tic involuntarily, which suggests that the rest 
was an essential part of the treatment. These 
results imply that the cumulative reinforce- 
ment associated with the dissipation of reac- 
tive inhibition may continue for longer periods 
of time than is commonly supposed. This 
interpretation is consistent with the findings 
of Jahnke and Duncan (11). 

A number of considerations 
during the experiments merit brief mention. 
It would have been useful to have left one 
tic untreated and to have taken control read- 


which arose 
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ings under standard conditions at regular in- 
tervals. Under these conditions, little or no 
conditioned inhibition would be expected and 
the prediction would be that the untreated 
tic would not decline during the course of the 
experiment. Second, an objective measure of 
the frequency of each tic outside the test 
situation at the beginning and end of the ex- 
periments is highly desirable. It should not be 
too difficult to measure the frequency of tics 
reliably in a natural situation. Third, atten- 
tion should be paid to the amplitude as well 
as the frequency of the tics; a measure of both 
variables would be useful. Fourth, S should 
be instructed from the start to reproduce the 
tic as accurately as possible in order to facili 
tate the growth of Ig. 

Nevertheless, certain features of the ob- 
tained results are highly suggestive. The 
shape of the curves suggests, in some respects 
at least, the obverse of a learning curve. There 
is no evidence of an initial rise in frequency, 
indicating that the habit strength of the tic 
was probably at its asymptote before massed 
practice was begun. Again, the decline in fre- 
quency appears to proceed according to the 
usual learning process. There are periods of 
stability (plateaus) followed by sudden falls, 
suggesting that a negative habit was actually 
being learned. This finding agrees with the 
Edmonson and Amsel (6), who 
found that in the extinction of conditioned 
avoidance responses, long periods during 
which no change took place, alternated with 
sudden sharp declines. The curves also show 
the sessional fluctuations common in learning 
curves, where all the relevant factors (notably, 
in this instance, drive level) cannot be con- 


results of 


trolled. 

An important problem concerns the rela- 
tionship between the anxiety and the tic. S 
reported at the end of treatment that she was 
much less anxious than formerly (reflected in 


many particular examples of behavior she 
gave). This reduction in anxiety follows di- 
rectly from the theory. If the ability to re- 


spond is signiicantly reduced, then the 
anxiety leading to the tics will persist. How- 
ever, since no traumatic event happens in fact 
as a result of the anxiety, the latter, being 
itself a conditioned response, gradually ex- 
tinguishes. This formulation suggests an al- 
ternative method of treatment. If the tics are 


prevented from occurring, then reduction of 
the anxiety should follow. When this happens, 
the tics will also decline, since they are a re- 
sponse to anxiety. Meige and Feindel (19) re- 
ported cases in which paralysis of facial tics 
for several days led to their disappearance for 
a period of several months. This hypothesis 
receives support from the findings of 
Solomon and his colleagues (23) that condi- 
tioned avoidance responses can be extinguished 
in dogs by presenting the CS a large number 
of times but preventing the avoidance re- 
sponse by restraint. The usual extinction 
procedures, which formerly had been unsuc- 
cessful, could then be applied. These argu- 
ments are the principal justification for symp- 
tomatic treatment. It is interesting to note 
that they follow directly from Mowrer’s two- 
factor theory. Mowrer himself has strongly 
opposed symptomatic treatment (21, p. 620) 
although making use of it in the treatment of 
enuresis. In terms of the theory, it would also 
be predicted that this method of treatment 
would be most successful with children be- 
cause gH, would presumably be at less than 
asymptotic strength. Thus, as soon as nega- 
tive practice began, most of the repetitions of 
the tic would be contributing towards the 
growth of gl, rather than sHg, and the growth 
of the negative habit (not-doing-the-tic) would 
proceed more rapidly than the growth of the 
positive habit. 


le 
aisSo 


SUMMARY 


A theoretical model was proposed to show 
that some tics may be conceptualized as 
drive-reducing conditioned avoidance re- 
sponses, originally evoked in a traumatic 
situation. 

From this model, a method of treatment 
was derived. It was predicted that if the tics 
were evoked voluntarily under conditions of 
massed practice, a negative habit of “not do- 
ing the tic” should be built up, resulting 
ultimately in the extinction of the tics and 
that this extinction should generalize beyond 
the test situation. 

The results of a number of experiments 
support the validity of the theory. There was 
a significant decline in the ability to respond 
voluntarily under various conditions of massed 
practice and rest. 

The optimum condition for the growth of 
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the negative habit appeared to be the com- 
bination of very prolonged massed practice 
followed by prolonged rest. 

Subjective reports of the patient indicated 
considerable clinical improvement outside the 


immediate test situation. 
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GSR CONDITIONING IN ANXIETY STATES, NORMALS, AND CHRONIC 
FUNCTIONAL SCHIZOPHRENIC SUBJECTS' 


EDMUND S. HOWE? 
University College, London 


N recent years, increasing attention has 
been given to experimental studies of 
anxiety and conditioning and to inter- 
pathological explorations of conditioning. 
There is now general agreement that anxiety, 
assessed either psychiatrically or psycho- 
metrically, plays a part in human eyelid (22, 
23, 24, 25) and GSR conditioning (2, 26, 27). 
Several studies have now confirmed that the 
neurotic S conditions more readily in a threat 
situation than does the normal (23, 25), a 
difference which may be explained by an in- 
crease in motivation derived from anxiety. 
The data on the conditionability of psychotics 
are, on the other hand, rather scanty and un- 
systematic (1, 15, 19, 21). Two recent studies 
(23, 25) have reported data and deductions 
for psychotics with the conclusion that these 
Ss condition more readily than both neurotic 
and norma! Ss. This conclusion is not consist- 
ent with the findings of Paintal (18), for ex- 
ample, who examined GSR to electric shock 
and to threat of shock in 450 normal and 450 
psychotic Ss. Although there was little differ- 
ence between the Ss’ response to the shock, 
there was a significantly smaller response in 
psychotic Ss to the threat of shock. The present 
study examined conditioned GSR behavior in 
hospitalized anxiety states, normals, and hos- 


pitalized chronic functional schizophrenics, on 
the hypothesis that these three groups in- 
volve, respectively, a descending order of total 


! This paper contains a portion of the data from a 
thesis submitted to The Department of Psychology, 
University College, in candidacy for the degree of 
doctor of philosophy in the University of London. The 
writer is indebted especially to John W. Whitfield, 
who throughout supervised the research, for his en- 
couragement and advice; and to Roger W. Russell 
and Hans J. Eysenck, each of whom made valuable 
comments at different stages of the work. Many thanks 
are also due to the members of the psychiatric, psy 
cholegical, and nursing staffs at St. Ebbas Hospital, 
Epsom, and The Belmont Hospital, Sutton, Surrey, 
for their willing and continuous cooperation in making 
it possible for the writer to execute this study. 

* Now with the Department of Psychiatry, Univer 
sity of Maryland School of Medicine 


drive strength (Hull’s D) with respect to such 
a nociceptive stimulus as electric shock.’ 

The question of differential drive strengths 
in the three subject groups may, of course, be 
approached empirically by a comparison of 
conditioning measures, but this procedure is 
not necessarily the optimal one. From the 
standpoint of Hullian theory (11), provision 
is made for a multiplicative combination of 
other drives (e.g., anxiety) with the relevant 
drive energizing the habit involved. In the 
present case, D may therefore be considered a 
direct function of anxiety level. In the case of 
the anxiety states, this contention has repeat- 
edly proved fruitful (22, 23, 24, 25); i.e., there 
is a sound basis for predicting that the anxiety 
cases will show more rapid, stronger condi- 
tioning than normal controls. There is, how- 
ever, a difficulty of relating Hull’s D to the 
anxiety level of schizophrenics because, while 
such patients may well show an absence of 
manifest anxiety symptoms (leading to a pre- 
diction of slower conditioning), it may be 
reasonably contended (e.g., 23, 25) that their 
withdrawal is a defense against high (hypo- 
thetical) internal anxiety (leading to a predic- 
tion of more rapid conditioning). Perhaps this 


+A second but major purpose of the study was to 
examine the effect of drive level D upon the steepness 
of the gradient of stimulus generalization, a relation 
ship that has been explored by more recent writers 
(4, 20). This is the essential reason for extinguishing 
one-third of each group to a different tone. When 
gradients of generalization over three test points were 
examined at each extinction trial for each group, 
however, only twa gradients showed a significant de 
parture from zero slope, which could, of course, occur 
by chance. Because of this result, data and discussion 
on generalization have been entirely omitted from this 
report. It was reasoned ex post factum that the outcome 
concerning the gradients was a function mostly of Ss 
having been given minimal familiarity with stimuli 
to be employed during the pretesting and conditioning 
phases (which would decrease responsiveness to stimu 
lus differences and hence spuriously increase generaliza 
tion). It was thus not possible to offer conclusive evi 
dence about the hypothesis set up: namely, that amount 
of generalization is a function of drive strength, rather 
than simply of stimulus similarity (see, for example, 
11, 16, 17, 20) 
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problem has been turned into an either/or 
question, neglecting the possibility that the 
degree of chronicity and deterioration in 
schizophrenia at least partly determines anx- 
iety level and hence reactivity in a threat 
situation, the relationship probably being an 
inverse one. 

It can thus be argued, as Malmo ef al. 
concluded, that, “the much lower reactivity 
of the chronic schizophrenic is evidence for a 
trend toward lowered reaction to pain as 
schizophrenia progresses toward chronicity” 
(14, p. 63). Malmo and his collaborators sug- 
gested, however, that this interpretation is 
probably applicable to rather specific experi- 
ences like pain, and may not be true at all 
for other situations involving overt adjust- 
ments. The characteristics of chronic schizo- 
phrenics, as presented by such writers as 
Hoagland (6, 7, 8), Hoskins (9), Hill et al. (5), 
and others, show a lowered reactivity to equi- 
librium functions, central disturbances rather 
than peripheral ones, and inadequate responses 
to stress as measured by lymphocyte counts 
and 17-ketosteroid output. It has also been 
suggested by Hoskins (9) that an inadequate 
adrenal cortical response, rather than a higher 
stress threshold, is responsible for the lowered 
reactivity of the chronic schizophrenic. The 
general picture emerging from a survey of this 
kind of work leads to the following concep- 
tualization of the chronic deteriorated schizo- 
phrenic: stress-produced responses which seem 
to be justified biologically frequently do not 
occur at all, or occur late, or occur to a di- 
minished degree. This general statement is, of 
course, quite the converse from and at the 
same time complementary to what is ex- 
pected in the anxiety neurotic, despite the fact 
that different pain thresholds have not been 
demonstrated for the different Ss. With such 
considerations in mind, it was considered most 
reasonable to hypothesize low drive strength 
with respect to electric shock in the schizo- 
phrenic popuiation used in this study, and to 
predict weaker, slower conditioning. 


METHOD 


Subjects 


4 total of 60 normal control Ss was used, half drawn 
from University of London students and half from the 
nursing staffs of the two cooperating hospitals. Student 
Ss ranged from 19 to 24 years; half of them were male, 
sixty per cent taking honors in psychology and the 


remainder coming from other departments. The sample 
of nurses contained an equal number of males and 
females, with am age range from about 18 to 25. All 
contro] Ss were volunteers. 

The anxiety Ss were in-patients drawn from Belmont 
Hospital, Surrey, a National Rehabilitation Center. A 
total of 60 Ss, psychiatrically diagnosed as acute or 
chronic cases of severe anxiety, were distributed as 
follows: anxiety reactions, 50; phobics, 5; compulsives, 
3; hysterics, 2. There were 29 males and 31 females. All 
Ss reported upon cooperated voluntarily. The ages of 
these patients were not recorded, but the mean was 
about 30 with a range of approximately ten years. 

The schizophrenic group was drawn from volun 
tary in-patients at St. Ebbas Hospital, Surrey, and 
was selected on the basis of psychiatric diagnosis. A 
criterion of chronicity was set at two years or more of 
illness; but in fact the mean duration of illness was 
over five years. After elimination of unusable data 
(due, for example, to some Ss’ quitting midway through 
the conditioning procedure), 60 Ss_ psychiatrically 
diagnosed as chronic, functional schizophrenic were 
distributed as follows: simple schizophrenia, 3; para 
noid schizophrenia, 4; catatonia, 10; hebephrenia, 1; 
and “mixed schizophrenic symptoms,”’ 42. There were 
26 males and 34 females, with a mean age of 32 years. 
Patients were assigned for conditioning by the psychi 
atric staff. Any S who attempted to leave the experi 
mental room was permitted to do so. 

A further group of 36 schizophrenic Ss proved to 
be unconditionable according to one of three criteria: 
no GSR whatsoever to twice the standard intensity of 
shock usually delivered to other Ss during pretesting; 
complete sensory adaptation to shock (i.e., no GSR) 
either beforz or at the last conditioning trial; or zero 
GSR to the first extinction trial. A comparison of 
these Ss with those who were conditioned is presented 
in the results. 


Apparatus and Method of Recording GSRs 


The Féré method, involving impressed current, was 
used in the circuit described by Lacey and Siegel (13) 
GSRs were read from a Pye “Scalamp” galvanometer, 
the period of which is two seconds, and which can be 
read quite accurately to 0.1 mm. In order to determine 
S’s resistance, it was simply necessary to adjust R, 
until approximately 60 microamperes were passing 
through him. Since the potential drop across S was 
always constant, basal resistance was obtainable from 
a specially calibrated dial, superimposed upon R2. The 
ohmic magnitude of any single GSR was thus directly 
proportional to the magnitude of the spot deflection 
The GSR electrodes consisted of two highly polished 
brass discs, one inch in diameter, each firmly embedded 
in the base of a vacuum tube ground down to a depth 
of about one-half inch. Felt plugs, soaked in two per 
cent saline before use with each S, were wedged in the 
cups on top of the electrodes. The electrodes were 
firmly attached with elastic bandages to the third 
phalanges of the second and third fingers of each hand. 
It was evident from preliminary explorations that in 
this position, rather than in the center of the palms, 
the contact pressure (and hence the amount of current 
permitted to flow) was relatively constant. 

Electric shocks were delivered from a coil buj}t into 
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an electronic timing mechanism. The intensity was 
varied with an ordinary tuning dial graduated in 
arbitrary units. The shock was of 75 msec. duration 
and was delivered across the S’s wrists through two 
polished brass electrodes. 

Conditioned and other tonal stimuli were produced 
by a type H.1. Advance audiofrequency generator, 
delivered to S through an Erikson low impedance 
earphone to the preferred ear, the headset 
balanced by a nonfunctional earpiece. The frequencies 
1000 and 468 c.p.s. Since the 
has provoked considerable dis- 


being 


employed were 1967, 
work of Hovland (10 
cussion, three of his frequencies were employed, but 
no check was made upon the accuracy of his psycho 
physical measurements. All tonal stimuli were presented 
at an intensity of approximately 40 decibels, but were 
equated for loudness in preliminary studies 

A special! device served to control 
the presentation of all stimuli at the touch of a post 
office relay switch. Tonal stimuli lasted for exactly 400 
interval of 

duration 


y built electronic 


msec., followed during conditioning by an 
100 msec., after which the shock of 75 msec 


was presented. 


Pretesting Procedure 


In all three groups, S was seated in one cubicle, 
while £, after installing and instructing him, controlled 
the whole conditioning procedure remotely from a 
separate cubicle. Each S was seated in a reclining chair 
with a footrest. He was first asked to relax for about 
three to five minutes during which period the basal 
resistance level typically assumed some steady point 
The resistor Ry, in the galvanic circuit was then ad- 
justed until a predetermined current of 59.3 microamps 
was passing through him, thus permitting measurement 
of basal resistance level to the nearest 1000 ohms. This 
initial adjustment, of course, placed a certain potential 
across S, which remained constant for him during the 
entire experimental session 

Pretesting of stimuli involved presentation of the 
1967 c.p.s. tone once per minute for three minutes, the 
GSR being observed on each occasion. It was evident 
in the vast majority of cases that adaptation was then 
complete, since, at the fourth pretesting presentation 
of the tone, the response was either zero or negligible 
No pretests of responsiveness to the other two tones 
were made, it being assumed that generalization of 
adaptation would be considerable if not complete 
Finally, each S was given three shocks of increasing 
intensity and was asked on each occasion for a verbal 
report of how each felt (unpleasant but not painful, 
painful, etc.). Successful attempts were made to deliver 
the same mean intensity of shock to each group of Ss 


Conditioning Procedure 


After a pause of five minutes, each S was given eight 
reinforced trials of the 1967 c.p.s. tone (CS) followed 
by the shock (UCS) at a staggered rate of one per min- 
ute. For each stimulation, minimal and maximal 
galvanometric readings were taken. One minute after 
the eighth reinforcement trial, each S was subjected 
to experimental extinction over eight trials. For the 
latter purpose, one-third of each group (i.e., normal 
control, anxiety, and schizophrenic groups) was pre 
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sented with the 1967 c.p.s. tone (0), another third 
with the 1000 c.p.s. tone (J), and the remainder with 
the 468 c.p.s. tone (2). Initial and maximal galvano 
noted through the eighth 


metric deflections were 


extinction trial. 


Treatment of Data 


Haggard (3) demonstrated that logarithm of con 
ductance change is a favorable transformation of GSR 
data, since it has been found to satisfy the requirements 
of normality and independence of response magnitude 
from basal resistance level. Initial examination of 
data at the first acquisition trial indicated that this 
measure was superior to both resistance change and 
conductance change according to these criteria 


RESULTS 


As a matter of course, the basal resistance 
levels at the beginning of the experiments 
were compared on an interpathological basis. 
Mean values in thousands of ohms and varia- 
bilities are shown in Table 1. Using separate 
estimates of the population variance for each 
group of observations, the appropriate ¢ values 
showed that the three means differed from 
each other at less than the .001 level of sig- 
nificance. Further, the variance of the anxiety 
group was smaller than that of the normal 
group (F = 1.58, p < .05), while that of the 
schizophrenic group was considerably larger 
than that of the normal group (F = 3.56, p 
< .001). These figures point up the relatively 
high variability of schizophrenic patients and, 
indirectly, the instability of psycho-physio- 
logical systems established in such Ss and 
reported by others (for example, 9) 

A chi-square test of normality of response 
magnitudes at the first acquisition trial showed 
that with log conductance transformations the 
anxiety and normal groups could be con- 
sidered as showing only random departures 
from normality (with 5 df, x? = 8.21 and p 
= .15 for the anxiety group; with 4 df, x* = 
6.77 and p = .15 for the normal group), while 
with 5 df, the schizophrenic group showed a 
x? of 12.5. The latter figure would be expected 


TABLE 1 
RESISTANCE LEVELS BEFORE CONDITIONING 
in Onms X 16 


BASAL 


Group 


Anxiety Normal Schizophrenic 
21.6 ‘ 53.0 
3. 5. 20.3 
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to occur only 3 per cent of the time in repeated 
sampling and hence indicates a significant de- 
parture from normality. The first acquisition 
trial data were also subjected to the ZL; test 
for equality of variance described by Johnson 
(12), and a value of 0.978 was obtained. 
Entering Nayer’s Table with k = 3 and n = 
60, it is found that a value of LZ; equal to or 
less than 0.967 justifies rejection of the hy- 
pothesis of equal variance (p = .05). The 
three groups studied may thus be considered 
to have initially equal variance. 

Product moment correlations between re- 
sponse magnitude and initial level were com- 
puted for each group at the first acquisition 
trial. Values of r were from .120 to —.239. 
None of these values is significant at the five 
level. Thus, while there are con- 


per cent 


siderable differences among the groups studied, 
response magnitude may be regarded as inde- 
pendent of the initial level from which it com- 


menced. A final check on the approximate 
comparability of the anxiety, normal, and 
schizophrenic groups was made via analysis 
of variance of response magnitudes at the 
first acquisition trial. Since the value of F 
was but 0.14, it may confidently be concluded 
that all groups were responding initially with 
comparable magnitude. 
Consequently, rates of 
studied via a two-way classification analysis 
of variance for repeated measurements. Table 
2 shows mean response magnitude for each 
group by extinction trials. Each mean value 
was derived from the responses of the 60 Ss 
in the group, each third of whom was partly 
extinguished to one of the three tones em- 
ployed in the study. In other words, eaca 
mean value is actually a regression constant, 
a. It will be observed from Table 2 that there 


extinction were 


TABLE 2 
GSR MAGNITUDES* DURING EXTINCTION TRIALS 
(Log. conductance change + K) 

Extinction Group 
Trial Num- 

ber Anxiety Normal Schizophrenic 
1.74 . 1.64 
1.58 1.5 1.41 
1.31 38 1.20 
1.28 ; 1.00 
1.21 a 0.75 
0.81 5 0.09 


* Each value in the table is a rounded mean of 60 Ss 
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are fairly consistent trends showing the anx- 
iety group to be responding with the greatest, 
the schizophrenic group with the smallest, and 
the normal group with intermediate magni- 
tude. The anaiysis is shown in Table 3. The 
between groups (i.e., anxiety, normal, and 
schizophrenic) F ratio of 8.16 (p < .001) was 
obtained using the variance between Ss in 
the same group as the error term. The between 
extinction trials F ratio of 235.21 (p < .001) 
and the Groups X Trials interaction F ratio 
of 7.98 (p < .001) were obtained using as 
error term the interaction for Pooled Ss X 
Trials. These results indicate at highly ac- 
ceptable levels of significance that by the 
eighth extinction trial all groups had shown a 
decrease in response magnitude. Further, the 
magnitude of response was consistently great- 
est in the anxiety group and smallest in the 
schizophrenic group, and the absolute inter- 
group differences in response magnitude were 
clearly an increasing function of the number 
of extinction trials. The data are thus in ac- 
cordance with the original assertion that the 
anxiety group has the higher and the schizo- 
phrenic group the lower total drive level 
(Hull’s D). They are in general accordance 
also with the relevant aspects of recent studies 
of anxiety and the extinction of conditioned 
responses (2, 22). 

Since heterogeneity of variance among ex- 
tinction data was not, however, precluded by 
application of Bartlett’s test, thus leaving the 
acceptability of these results somewhat equiv- 
ocal, a further test was performed with a non- 
parametric method devised by Wilson (28). 
This test has the advantage of being able to 
explore interaction as well as main effects 
without any supposition concerning the vari- 


TABLE 3 
ANALYSIS OF VARIANCE OF EXTINCTION DaTA 
Mean 


Source of Varia- 
h. df . 
Square 


tion 


8.770 


Between groups 
1.074 


Between Ss in 
same group 
Between trials 
Interaction: 10 
Trials X 
Groups 
Interaction: 
Pooled Ss X 
Trials 
Total 


39.752 
1.349 


0.169 


1079 
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TABLE 4 
CONTINGENCY TABLE FOR Extinction Data* 
Group 


Schizo- 


phrenic vi. 


Anxiety Normal 


7 9 
13 14 
30 20 
30 39 101 
32 54 132 
41 59 60 160 

153 195 192 540 = m 


23 
49 


75 


wh hv 
muh ~ 


Sue wn 
+ 
ma 


* Subscripts a and 5} indicate that the referent lies above or 
below the median, and %, = m, and all ;,; are equal 


ability of data. The median for the entire set 
of data was computed and a 2 X fr X ¢ con- 
tingency table (see Table 4) was set up, where 
Trials) and c = 3 (Groups), and the 
third factor corresponds to an exact division 
of the data into two sets, above and below the 


ys 6 


median. A total value of chi square was com- 
puted which was then broken down into main 


and interaction effects. The resultant analysis 
(Table 5) yielded a significant between groups 
value of 12.078 (with 2 df, p < .01), a signifi- 

between trials value of 289.96 (with 5 
if, p 001), and a significant interaction 
value of 29.018 (with 10 df, p < .005). While 
these results are statistically in general accord- 
ance with those obtained from the parametric 
method, there is one important consideration 
which interpretation yet equivocal. 
Reference to Table 4 shows that when con- 
tingency data are used as an indirect measure 
of response strength during extinction, the 


cant 


leaves 


normal and schizophrenic populations show 
virtually zero differences in relation to each 
other. The between and 
interaction values of chi square are thus prob- 
ably due largely to the predicted deviation of 
the anxiety group from the other two. On the 
one hand, such an outcome is possible because 


significant groups 


of the additive properties of chi square; and 
on the other, it should be noted that consider- 
ing the behavior measure employed in this 
study, resort to a nonparametric method po- 
the loss of considerable 
information (cf. the relatively consistent 
smaller mean response magnitude in the 
schizophrenic group in Table 2). 

Some further light was thrown upon the 
dubious normal-schizophrenic differences by a 
comparison of the 60 psychotic Ss with 36 who 


tentially involves 
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TABLE 5 
SuMMARY OF NONPARAMETRIC ANALYSIS 
or Extinction Data 


Source : df 
12.078 2 


289 .960 5 
29.018 10 


Columns (Groups) 
Rows (Trials) 
Interaction G X T 


Total 331.056 17 


were not conditionable because of zero GSR 
to shock, complete GSR adaptation to shock 
during the acquisition trials, or failure to give 
any response to the first and subsequent ex- 
tinction trials. The unconditionable group had 
been given stronger shock (p < .0i), showed 
a much higher resistance level (p < .01), were 
an average of 4.5 years older (p = .05), and 
had been ill for an average of seven years 
longer (p < .01). While certainly suggesting 
that the schizophrenic population was not 
randomly sampled, these facts enhance the 
significance of other results reported here. 


DISCUSSION 


Theoretical considerations led at the start 
of this work to a prediction of strongest con- 
ditioning in the anxiety group and weakest 
conditioning in the schizophrenic group. 
While the anxiety Ss clearly showed signifi- 
cantly greater response magnitude during 
extinction compared with all other Ss, it was 
not possible to demonstrate conclusively in 
any rigorous, formal fashion that the chronic 
schizophrenic Ss showed significantly smaller 
response magnitudes than the normal Ss. But 
there can be no doubt whatsoever that a sig- 
nificant degree of extinction occurred by the 
eighth extinction trial, that a significant inter- 
action exists between groups and trials, and 
that the schizophrenic Ss show a significantly 
lower response magnitude than the anxiety Ss. 
These general conclusions are thus in accord- 
ance with recent studies of questionnaire- 
selected high and low anxious Ss (e.g., 2, 22, 
24). 

At the same time, other recent predictions 
and conclusions (23, 25), such as the conten- 
tion that the psychotic conditions autonomi- 
cally more readily than both the normal and 
the neurotic, are contradicted by the results 
presented here. The prediction made for the 
behavior of a schizophrenic in a nociceptive 
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conditioning situation will, however, more 
likely be a function of the degree of chronicity 
and perhaps the deteriorative character of the 
psychosis. Some of the psychotics studied by 
Spence and Taylor (23, 25) were outpatients; 
those here reported upon were all chronic, 
hospitalized cases, which may partly account 
for the obtained differences. But the compari- 
son reported above of conditionable and un- 
conditionable schizophrenic Ss, showing a 
greater chronicity in the latter group, most 
strongly suggests a relationship between 
chronicity and autonomic conditionability. 

In the light of the foregoing considerations, 
it is very difficult to make an overall state- 
ment about anxiety, psychosis, and Hull’s D, 
unless the relative strengths of D can be 
roughly specified at different stages of the dis- 
order. In other words, some psychotics may 
condition more readily than normals and even 
than anxious Ss; but, a priori, the converse is 
just as likely, depending upon the various 
qualifications raised here 


SUMMARY 


Conditioning of the GSR to electric shock 
in 60 hospitalized anxiety states, 60 normal 
controls, and 60 chronic functional schizo- 
phrenics yielded the following conclusions: 

1. With magnitude of response (in log con- 
ductance units) during experimental extinc- 
tion as an indirect measure of strength of con- 
ditioning, the anxiety Ss showed significantly 
stronger conditioning than both the normal 
und schizophrenic Ss, the difference in re- 
sponse magnitude being some increasing func- 
tion of the number of extinction trials, as 
shown by a significant interaction term. While 
schizophrenic Ss showed the lowest magnitude 
of response, it was not possible to offer proof 
with either a parametric or a nonparametric 
method of analysis that this magnitude was 
statistically less than that for the normal Ss. 

2. It was argued that predictions concern- 
ing the autonomic conditionability of the 
schizophrenic must vary with the degree of 
chronicity of and perhaps the amount of de- 
terioration in the disorder, with less reactivity 
(for example, Hull’s drive strength, D) being 
more typical of the chronic and deteriorated 
case. A comparison of schizophrenic Ss who 
were conditionable with some who were not 
gave support to such an hypothesis. 
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PARENTAL EVALUATIONS OF CHILDREN AND CHILDREN’S 
SELF-EVALUATIONS 


MALCOLM M. HELPER 


Nebraska Psychiatric Institute, University of Nebraska College of Medicine 


ELF-EVALUATIVE attitudes are widely be- 
lieved to arise rather directly from 
evaluations made of the individual by 

others, with especial influence frequently being 
attributed to evaluations made by parents 
(1, 9). In spite of this apparent theoretical 
agreement, or perhaps because of it, very little 
empirical study has been made of the relation- 
ships between parental evaluations of the child 
and the child’s self-evaluations. Jourard and 
Remy (6) have made what appears to be the 
only study having direct bearing on this 
question. They studied relationships between 
college students’ satisfaction with self and body 
(self- and body-cathexis) and parental evalua- 
tions of the students, the latter being reported 
by the students themselves. The correlations 
ranged from .56 to .70 between self-satisfaction 
expressed by the students and parental satis- 
faction with the students as reported by the 
students. These authors note, however, that 
parental evaluations should be assessed directly 
before definite conclusions are drawn, because 
reports by the students as to their parents’ 
attitudes may be subject to systematic dis- 
tortion. 

In the present study, independent assess- 
ment was made of self-evaluation in a group of 
8th and 9th grade children and of evaluation 
of the children by their parents. 


METHOD 


Forty-two adjectival rating items were developed 
on the basis of dimensions of personality description 
found by Cattell to be representative of a very large 
number of trait names (2). Four scales were also se 
lected from Osgood’s analysis of connotative meaning 
7). It was believed that these bases for selection would 
insure a wide sampling of behavior descriptions and 
avoid undue emphasis on any one aspect of personality 
description. Each scale was defined at the two extremes 
by adjectives or adjectival phrases and offered a seven 
point scale for rating, as in the example below 
| | a a a a _: quiet 
Subjects were asked to rate several concepts on each 
of the forty-six scales by checking the space that best 
described that concept as they viewed it. Each child 
was asked to rate himself as he ordinarily thought of 
himself (actual self-concept), and to rate himself as he 
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would most like to be (ideal self-concept). Each parent 
used the scale to rate his child as he ordinarily thought 
of the child (actual child-concept) and to rate his child 
as he (the parent) most wanted the child to be (ideal 
child-concept). Other ratings were collected at the same 
time, and were utilized in investigating other aspects of 
children’s self-description (5). 


Subjects 


Data were gathered on the 74 children in the 8th 
and 9th grades in the secondary school operated by the 
University of Illinois. One or both of the parents of 53 
of the children cooperated in providing the ratings de 
scribed above. Because of incomplete ratings on the part 
of a few Ss, the number of families included in different 
computations varies from 50 to 53. Each computation 
included all of the families providing pertinent data 
Since over one-half of the fathers involved were em 
ployed in an academic capacity by a university, the 
families must be considered to be primarily representa 
tive of a narrow portion of the socioeconomic spectrum 


Measures of Evaluative Attitudes 


Two methods of measuring attitudes toward (or 
“acceptance”’ of) the self and others find current usage 
In one method, items are selected that unequivocally 
indicate a favorable or unfavorable attitude toward the 
subject in question (3, 8). Ratings on these items can 
be summed into a single score since the favorability 
indicated by a given response has been determined in 
advance. This method has the advantage of simplicity 
of scoring but is limited to items that have unambiguous 
evaluational implications. In the second and more popu 
lar method, “actual” ratings on a group of items are 
subtracted from “ideal’”’ ratings on the same group of 
items. The differences are summed across items to yield 
a score that reflects the extent of the discrepancy be 
tween the actual and ideal concepts. This method (of 
which the Q sort is a variant) allows for differences 
among Ss in evaluating the desirability of a given trait 
However, the difference scores are necessarily less re 
liable than the raw from which they are 
derived (4). 

In the present study, both types of measures were 
applied to children’s self-ratings and to the parents’ 
ratings of their children. A Favorability score was 
computed by summing “actual” ratings on 15 items on 
which there was very high agreement among raters as 
to desirability. These were items on which the ideal 
self-ratings were heavily concentrated toward one end 
of the rating scale. All items having a mean ideal self 
rating more than 1.5 scale points from the midpoint of 
the 7-point scale were identified. Of these, the fifteen 
items having the smallest standard deviation in ideal 
self-ratings were sele+ted for the Favorability scale 
In practical terms, the distribution of ideal ratings on 


ratings 
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these items is so narrow that subtracting the actual 
ratings from them would add little information. The 
simply the the raw 
ratings on these fifteen items after all items had been 
high 


4 Child-Favorability score was obtained from 


Favorability score was sum of 


aligned so that a rating indicated a desirable 


standing 
each parent’s actual child ratings, while a Self-Favor- 
ability score was obtained from each child’s actual 
self-ratings 

A Self-Acceptance score was derived for each child 
from the actual-ideal discrepancies generated by the 
For the 31 discrep- 


squared and then summed in accord- 


remaining 31 items each child 
incy scores were 
ance with the specifications for D, the statistic of profile 
comparison (4). The discrepancies on these items between 
each parent’s actual-child and ideal-child ratings were 
treated in like fashion to yield a Child-Acceptance score 
In this method of scoring, the higher the score, the 
Neither the Self-Acce ptance nor 


the Child-Acceptance scores were distributed normally 


less the Acceptance 
For this reason, nonparametric statistic al procedures 
were used in all analyses 

of each child’s acceptability to his peers 
study the 
Each child had been 


An estimate 
was available from a sociometri made by 
two classroom teachers involved 
usked to list classmates with whom he would like to 
The number of choices re 


work on a study pr ject 


eived by each child was taken as the index 
The data described above permit study of several 
juestions relating to the problem of self-evaluation. 
For method 
ence between the two approaches to the assessment of 
The question of primary 
of correlation between parental 


logical purposes, the degree of correspond- 


self-evaluation was studied 


t th 


interest he degree 


evaluations of children and children’s self evaluations— 


was investigated in terms of both the Favorability and 
the Acceptance variables. The relationship between 
children’s self-evaluations and acceptability to peers 
was also considered 

RESULTS 


Reliability coefficients were comput2d by 
the split-half method for Favorability and for 
Acceptance in both parents children. 
Rank-difference correlation was used because 


and 


of the non-normality of the Acceptance scores. 
These after for total 
length by the Spearman-Brown formula, are 


coefficients, correction 
presented in Table 1. The reliabilities are seen 
to be only moderate, but do appear high 
enough to justify further study of the meas- 
ures. 

The correlations between the two measures 
of attitudes toward the child, shown in Table 
2, are much lower than their respective relia- 
bilities. The two measures do not appear to tap 
the same attitudes 
toward the child, either in the parents or 


aspects of evaluative 


within the child himself. 


Rank difference correlations between pa- 
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TABLE 1 
RELIABILITY OF FAVORABILITY AND ACCEPTANCE 
ScorEs* 
Group N Favorability Acceptance 
Boys 34 .86 mY 
Girls 40 77 .80 
Fathers 53 66 80 


Mothers 51 85 .83 








* Coefficients are rhos corrected by the Spearman-Brown 
formula for total length of the scales 


TABLE 2 


INTERCORRELATIONS OF FAVORABILITY AND 
ACCEPTANCE MEASURES 


Group N Rho 
Boys 34 24 
Girls 40 21 
Fathers 

Of Boys 20 10 

Of Girls 30 18 

All Fathers 50 16 
Mothers 

Of Boys 21 42 

Of Girls 30 23 


All Mothers 51 oo" 


* Significant at the .01 level 


rental Favorability and Acceptance scores and 
the corresponding self-evaluative scores in 
children are presented in Table 3. In general, 
the parent-child correlations are low for both 
the Favorability and the Acceptance variables. 
Of the 24 correlations computed, 5 reach sig- 
nificance at the .05 level, 4 fall between the 
05 and .10 levels, and 15 do not approach 
statistical significance. All except one of the 
correlations fell in the predicted positive 
direction. (It is not claimed, of course, that the 
24 correlations are entirely independent.) On 
the whole, then, the data do seem to point to 
the existence of a slight but real tendency 
toward similarity between parents’ evaluations 
of their children and the children’s self-evalua- 
tions. 

While the correlations do not differ enough 
among themselves to warrant differential inter- 
pretations of the various parent-child com- 
binations, it can be noted that all but one of 
the significant and near-significant correlations 
are between corresponding variables at the 
parent and child levels. That is, parental 
Favorability has significant relationships only 
with Self-Favorability in the child, while 
parental Acceptance is related much more 
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TABLE 3 
PARENT-CHILD CORRELATIONS (RHOS) FOR Two MEASURES OF EVALUATION OF THE CHILD 


Boys 
Favor- Accept- 
ability ance 


Fathers 
Favorability 20 .44* .26 
Acceptance 20 .18 .42* 
Mothers 
Favorability 21 .20 .10 
Acceptance 21 .29 -08 


* Coefficient reaches 10 level, two-tailed test 
** Coefficient reaches .05 level, two-tailed test 


consistently to children’s Self-Acceptance than 
to children’s Self-Favorability. 

The small size of the parent-child correla- 
tions does not rule out the possibility of more 
substantial relationships in certain subgroups. 
In view of the low correlations between 
parental Acceptance and Favorability, it would 
seem reasonable to study the possibility that 
different combinations of the two variables 
may be associated with differences in children’s 
self-evaluations. One could reason that chil- 
dren whose parents view them unfavorably 
would be rather highly sensitive to their 
parents’ acceptance of them and their faults, 
even though the children’s Self-Favorability 
does not reflect the parents’ basically negative 
view of them to any great extent. On the other 
hand, children whose parents view them as 
basically sound might well be less sensitive to 
any unusually high aspirations (high actual- 
ideal discrepancy) that the parents might 
express for them. In terms of the present 
variables, there should be a relatively high 
relationship between parental Acceptance and 
children’s Self-Acceptance (or Self-Favorabil- 
ity) in cases where parents have low Favora- 
bility scores, but not in cases where parents 
have high Favorability scores. 

In accordance with this hypothesis, the rela- 
tionship between parental Child-Acceptance 
and the two self-evaluative variables was 
studied separately in the group of families 
characterized by high parental Favorability 
and in the group characterized by low Favora- 
bility. The median Favorability score was used 
as the dividing line. Separate analyses were 
carried out for mothers and for fathers. In each 
low Favorability group, children receiving high 
and low parental acceptance were compared 
with respect to Self-Acceptance and Self- 


Girls All Ss 


Favor- Accept- Vv Favor- Accept- 
ability ance ability ance 


.26 .06 on 
.28 .44°* : = 


.18 : 22 
—_ 5 .16 


TABLE 4 
CoMPARISON OF HiGH AND Low Sociometric Groups 
ON SELF-EVALUATIVE VARIABLES 
(U-test 


Self-favorability Self-acceptance 


Sum of Sum of 


p f 
ranks ‘ ranks é' 


Boys 34 15 221.5 a 160.5 001 
Girls 40 22 423.0 466.0 


* Number of cases in low sociometric group. 


Favorability. Similar comparisons were made 
in each high paternal Favorability group. The 
Mann-Whitney U test was used in making al! 
statistical tests. The results of this procedure 
can be described briefly: The one significant 
difference occurred in the group characterized 
by low Favorability of the mother. Here, as 
expected from the hypothesis, children re- 
ceiving low Acceptance from the mother had 
lower Self-Acceptance than did those receiving 
high Acceptance from the mother. However, 
the preponderance of nonsignificant differences 
indicates that, in terms of the present data, the 
hypothesized effects are slight at best, and 
probably should be discounted completely 
without further evidence. 

To study the relationship between children’s 
self-evaluations and their acceptability to their 
peers, the distribution of choices received in 
the sociometric procedure was first dichoto- 
mized for each sex separately. For each sex, 
the U test was employed to evaluate the dif- 
ference in Self-Favorability and Self-Accept- 
ance between the more and the less popular 
children. These results are presented in Table 
4. The popular boys express a significantly 
higher degree of Self-Acceptance than the 
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unpopular boys. No significant differences 


emerge for girls. 
DISCUSSION 

In interpreting the rather tenuous relation- 
ship observed between parents’ evaluations of 
children and the children’s self-evaluation, we 
should first take into account the problem of 
the validity of the measures used. While the 
children’s verbal responses can, if desired, be 
taken as operational definitions of their self- 
evaluative traits, the parental variables speci- 
fied by the theoretical position mentioned at 
the outset are those which reflect actual 
evaluative behavior by the parent toward the 
child over a span of several years. From this 
point of view, it is rather encouraging that an 
apparently significant relationship does emerge 
when only rating data are available from the 
parents. 

The contrast between the present low cor- 
relations and the relatively high correlations 
reported by Jourard and Remy (6) can be 
interpreted in two basically different ways. It 
is possible, as those authors note, that the 
children’s reports of their parents’ attitudes are 
distorted (possibly by response set) in such a 
way as to coincide spuriously with the chil- 
dren’s self-ratings. On the other hand, it is 
also possible that children’s reports are actually 
more valid as measures of parental evaluations 
than ratings by the parents themselves. This 
uncertainty would seem to call for validation 
of both types of measures against observed 
parental evaluative behavior. 

Regardless of the size of the true correlations 
between parental attitudes and children’s self- 
evaluations, no causal interpretations are justi- 
fied until temporal trends in the two variables 
can be studied. This type of study would repre- 
sent a logical step from the presently available 
evidence of a positive relationship between 
parental evaluations of the child and the 
child’s self-evaluations. Although psychody- 
namic interpretations are premature at the 
present time, it does seem wise, in view of the 
small size of the correlations found, to note that 
parental evaluations may have very great 
consequences for the child’s personality even if 
they are not reflected directly in the child’s 
self-evaluations. In fact, a study of the reac- 
tions of children whose parents evaluate them 
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negatively might well shed much light on 
defensive processes. 

The evidence that Acceptance and Favora- 
bility are substantially independent variables 
is rather difficult to interpret theoretically 
since the two types of measures were originally 
distinguished on methodological grounds and 
were expected to reflect the same attitudinal 
variables. It was suggested in comparing the 
effects of the various combinations of parental 
Favorability and Acceptance that Favorability 
reflects simply a description of an individual in 
evaluative terms while Acceptance indicates 
the attitude toward the individual with his 
perceived defects or assets. This differentiation 
would suggest that Acceptance is the more 
emotionally loaded of the two variables and 
has more significance for interpersonal relation- 
ships. While the data of the present study 
do not resolve this question, it can be noted 
that the single significant relationship between 
self-description and sociometric status in- 
volved the Acceptance variable. While the 
present results provide only the scantiest basis 
for interpretation of these two aspects of self- 
evaluations, they do point up the need for 
further methodological study of such con- 
structs as self-acceptance and self-esteem. 


SUMMARY 


Two self-evaluative measures, labelled Self- 
Favorability and Self-Acceptance, were de- 
rived from self-ratings by 74 8th- and 9th-grade 
children. Analogous measures of parental Fa- 
vorability and Acceptance with respect to the 
child were derived from ratings by parents of 
51 of the children. Sociometric information was 
available for the children. Although the meas- 
ures of Favorability and Acceptance had 
adequate reliability, correlations between the 
two variables were low in both parents and 
children. Correlations between parental evalu- 
ations and children’s self-evaluations tended 
to be small but consistently positive. The 
results were interpreted as evidence that 
attempts to increase the validity of assessments 
of parental attitudes and further to differenti- 
ate aspects of self-evaluations would prove 
rewarding. 
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IDENTIFICATION IN GROUP THERAPY’: ? 


GERALD J. BRISKIN 
Byron Harless and Associates, Miami, Florida 


HE process of psychotherapy implies 

changes in behavior, and the variables 

associated with these changes have be- 
come a focal point for research in therapy. Ina 
recent study, Schrier (11) found that identifica- 
tion with the therapist in individual therapv 
was significantly related to therapeutic success. 
This suggests that the ability to identify with 
the therapist is an important variable in the 
prognostic evaluation of candidates for ther- 
apy. The intent of this study is to investigate 
two of the variables contributing to identifi- 
cation. 

For the purposes of this paper, identification 
is operationally defined as changes in patient 
behavior from pretherapy status to behavior 
similar to that manifested by the therapist. 
The variables selected for study were rigidity 
and lability. 

Rigidity is a well-studied entity. Consider- 
able research indicates that certain individuals 


have difficulty in changing their patterns of 
behavior. They seem encysted against the 
world and behave in a stereotyped and inflex- 


ible manner. Tension and anxiety are well 
“bound” in this personality configuration, and 
emotional content is effectively removed from 
thoughts and ideas to the extent that anxiety 
is not strongly experienced by this kind of 
individual in therapy. Fenichel (5) suggests 
that identifications are always motivated by 
drives or tension. Since the rigid individual 
may not experience tension very readily, and 
since access to and from the ego is limited, it is 
quite likely that he will have difficulty in 
identifying. 

The term lability, used in this context, refers 
to a personality pattern that closely approxi- 
mates the preschizophrenic personality in 
which the ego boundaries are vaguely defined 
and weakly maintained. An individual pos- 
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sessing this kind of ego structure has difficulty 
in distinguishing between self and non-self. 
Because of this, his reality-testing abilities are 
limited, and he often confuses events occurring 
in the external world with events occurring in 
himself. As a result, his ability to identify is 
severely impaired. 

Based upon this reasoning, two hypotheses 
were formulated: (a) There is an inverse 
relationship between rigidity and identifica- 
tion; (b) there is an inverse relationship between 
lability and identification. 


METHOD 


rhe Ss utilized in this study were ninth- and tenth- 
grade male and female high school students. They were 
selected for the experiment by asking members of 
the social living classes if they would like to volunteer 
to participate in groups that would discuss personal 
problems of adolescents. They were told that only a 
limited number could be selected and that they would 
be excused from class to participate. A total of 109 
students volunteered. All of the volunteers were then 
rated by their social living class teachers on a form 
designed to assess emotional adjustment. Those stu- 
dents who were rated most poorly adjusted were then 
eliminated from the study on the ground that they 
would be least likely to benefit from short term therapy. 

Three groups were selected to participate in therapy, 
cach composed of five boys and five girls. Before the 
therapeutic groups were formed, Ss were asked to fill 
out a specially designed questionnaire,’ and were given 
group forms of the Rorschach. These forms and tests 
were administered by members of the research staff 
other than the therapists. 

Following this collection of data, the groups started 
in therapy, each group having a different therapist. 
The groups met twice a week over a period of four 
months, with interruptions for schooi holidays. Ss were 
told that they were free to drop out of the groups up to 
the fifth session, but they were expected to stay in the 
group after the fifth meeting. There was no other pres- 
sure to compel attendance. 

The therapeutic approach followed group analytic 
principles, with the therapist very active in serving as 
the focal peint of the group, yet being essentially per- 


* To save printing costs, a 3-page tabular presenta- 
tion of the questionnaire has been deposited with the 
American Documentation Institute. Order Document 
No. 5465 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 
ington 25, D. C., remitting $2.00 for microfilm or 
$3.75 for photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress 


195 





196 


missive and accepting of all behavior, including fairly 
extreme forms of acting out. This technique is based 
on Wender’s (13) notions of group therapy and is 
designed to elucidate and clarify defensive behavior and 
its significance 

At the conclusion of therapy, all Ss again filled out 
the same questionnaire that they had at the beginning 
of the study. These questionnaires were administered by 
the therapists. 

Each of the therapists was rated on this same ques- 
tionnaire by experienced third- or fourth-year clinical 
psychology graduate students who knew them well 
but who were not involved in any competitive relation- 
ships with them. These judges were instructed to 
answer the questionnaire as they believed the therapist 
would. A combined rating for each therapist was then 
obtained by averaging the judges’ ratings. 

Having the therapists rated by judges seemed a 
sounder procedure than having the therapist rate him- 
self or having him rated by his group members. With 
regard to the first alternative, the therapist, regardless 
of his insight, may still tend to rate himself defensively 
and thus present a distorted picture of himself. Cutler 
(4) made use of this reasoning in utilizing discrepancies 
between therapists’ self-ratings and observer ratings as 
an index of therapists’ areas of conflict. The other al- 
ternative, on casual examination, seems to have con- 
siderable merit. However, since Ss in therapy may be 
involved in all sorts of transference relationships with 
the therapist, any ratings of him by them may also 
be distorted. The use of external observers was therefore 
decided upon 

The questionnaire defining identification in therapy 
derived from Murray’s (8) inventory was altered to fit 
the needs of an adolescent group. It consisted of 50 
statements, each of which could be answered always, 
at times, seldom, never. Each item was a description of 
behavior or feelings—for example: “I feel I should like 
to be a leader of a group,”’ or “I try to be good in home 
and in school.” 

The Rorschach test was essentially a group adapta- 
tion of the individual administration. The Harrower- 
Erickson Rorschach slides were used. The protocols 
were scored for Beck’s (1) locations, content, form 
levels, and populars and for Klopfer’s (7) determinants. 
Since group Rorschachs differ considerably from indi- 
vidual Rorschachs, especially as to the inquiry, it was 
necessary to adhere to certain additional rules: If a 
percept was named in the free association but there 
was no inquiry or location, it was not scored. If a 
percept was named in free association and there was a 
location but no inquiry, it was scored F only for de- 
terminants. Precision alternates were scored only once. 
Otherwise, the records were scored in accordance with 
the usual rules specified by Klopfer and Beck. Ne 
reliability study of the scoring was done since numerous 
reports in the literature (10, 12) indicate that high 
interjudge reliability can be expected between 
Rorschach of considerable experience. 

The measure of identification utilized in this study 
was operationally defined as the total number of shifts 
from each S’s pretherapy questionnaire to his post- 
therapy questionnaire in the direction of the therapist 
as rated by the observers. The measures of lability 
and rigidity were derived from the Rorschachs. How 
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ever, since the determinants obtained from a group 
administration of the Rorschach vary considerabiy 
from those obtained in individual administration, it 
was not possible to apply standard interpretive pro- 
cedures to the Rorschach records without first 
evaluating the data. In order to accomplish this evalu- 
ation, the Rorschachs for the volunteers who did not 
receive therapy were scored and distributions made of 
all determinants. Since most of these distributions were 
greatly skewed, medians and quartile deviations were 
calculated rather than means and standard deviations 
These statistics were then used as standards in setting 
up the rating scales 

However, Rorschach variables do not scale pro- 
portionately; a total of four M responses, for example, 
is not twice as significant as two M responses. There- 
fore, in assigning weights to Rorschach variables, 
cutting points must be determined in accordance with 
the shape of the distribution of the variables dealt with 
Furthermore, since most Rorschach work has been done 
with individually administered tests, the “accepted” 
cutting points must also be modified in terms of the 
group administered distributions. An example follows: 

It is a standard Rorschach procedure to regard 

a protocol in which }' comprises more than fifty per 

cent of the responses as constricted. But the median 

F percentage for the group records was 74.5. Two 

explanations are available: The population dealt 

with produced unusually frequent F percepts, or the 
lack of an individually conducted inquiry resulted 
in many responses being scored F which otherwise 
would not have been so scored. Since the latter ex 
planation seems more tenable, the notion of the 
cutting point for constriction as 50 per cent was 
discarded, and the cutting point used was based on 

Q3, which was 88. Thus, a weighted scale for con- 

striction could be established with scores of 87 per 

cent or below weighted at zero, 88-95 per cent at 
one, and 96 per cent or over at two. 

Based on this procedure, the rating scales for lability 
and rigidity were developed. To determine the rigidity 
or lability scores for a particular S, the determinants 
from his Rorschach were given the appropriate factor 
weights as indicated in the respective scales, and these 
weights were then totaled. Since some negative scores 
were obtained, all of the scores had a constant added 
to them so that positive numbers could be dealt with 

The measure of rigidity was based upon pertinent 
theoretical and clinical positions as found in the litera- 
ture (2, 6, 7, 10). One study of particular relevance by 
Cowen and Thompson (3) related certain Rorschach 
findings to measures of Einstellung rigidity. The rigidity 
index may be found in Table 1. 

As discussed previously, the measure of lability is 
closely related to what has been described as the pre- 
schizophrenic personality. Consequently, the scale 
measuring this configuration was derived from standard 
Rorschach “signs” (2, 6, 7) considered as indicative of 
such a personality. These signs and their weights 
are given in Table 2. 

The interobserver agreement of ratings of the 
therapist can be found in Table 3. These agreements 
are given in terms of coefficients of contingency rather 
than the usual product-moment correlations. This 
statistical treatment was necessitated by the nature of 
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TABLE 1 
Ine Ricipity RATING SCALE 


Factor 
I. Total Responses 
11 or less 
27 or more 
II. D plus Dd% 
85 or more 
25 or less 
Ul. Sum M 
0 
1 
2 or more 
. FJ 
85 or more 
50 or less 
74g 
90 or more 
60 or less 
AQ% 
60 or more 
30 or less 
Number of Content Categories 
4 or less 
9 or more 
. Populars 
6 or more 


TABLE 2 
Tne Laspruity Ratinc SCALE 


Weight 


Factor 


Sum CF plus C greater than FC (FC = 
5, CF = 1.0, C = 1.5) 
by 0-.5 
by 1.0-2.0 
by more than 2.0 
Total M 
0 
1 
2 or more 
. FZ plusF + % 
F% greater than 80, 
F + % less than 65 
Total FM 
0-1 
2 or more 
/. M quality 
For each flexor 
For each extensor 
Pure H% (does not include human like 
percepts 
15 or less 
30 or more 
Card Rejections 
0-1 
2 or more 


the data. Ratings on the questionnaire could be made 
only in terms of four categories, always, at times, seldom, 
never. Since the distributions of these ratings were 
skewed, and since product-moment correlations are 
attenuated by a limited range (only four categories in 
this sample), the usual correlational techniques were 


inappropriate and, consequently, the contingency 
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TABLE 3 
COEFFICIENTS OF CONTINGENCY*** FOR INTERJUDGE 
AGREEMENT IN RATING THERAPISTS 


Therapist C 


Therapist B 


Therapist A 


36° 


J .40** 


* .05 level of confidence. 
** 01 level of confidence. 
*** Upper limit of coefficient of contigency = .7071. 


coefficient was substituted. A fourfold table was used, 
grouping always and at times in one category and some- 
times and never in the other. 


RESULTS 

The hypothesis specifying the relationship 
between identification and rigidity was tested 
by correlating the identification scores of Ss 
in the therapy groups with their rigidity scores. 
A product-moment correlation of —.73 was 
obtained, significant at the .01 level. In order 
to test the possibility that rigid individuals 
might show less identification with the thera- 
pist simply by making fewer shifts on the 
questionnaire, regardless of direction, as non- 
rigid Ss, the total number of shifts for each S 
was correlated with his respective rigidity 
score. The product-moment correlation was 
.09. A é test between the differences of the 
mean number of shifts for the rigid and non- 
rigid group was similarly not significant. The 
possible objection therefore seems irrelevant, 
and the findings may be interpreted as sup- 
porting the hypothesis. 

The hypothesis specifying the relationship 
between identification with the therapist and 
lability was also tested by correlating the 
identification score of each individual in 
therapy with his lability score. A product- 
moment correlation of —.64 was obtained, 
significant at the .01 level. This finding is also 
interpreted as supporting the hypothesis. 

DISCUSSION 

The primary intent of this study was to make 
an exploratory attempt to evaluate two hy- 
potheses derived from psychoanalytic theory as 
they apply to group therapy. In general, both 
hypotheses were supported. These findings 
lend emphasis to the role that ego structure 
plays in identification and to the fruitfulness of 
research on ego functions. 

Previous studies have suggested that the 
ability of patients to identify with the thera- 
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pist markedly increases the probability of 
therapeutic success. Since there are far more 
prospective patients than the limited number 
of psychotherapists can accommodate, the 
selection of candidates for therapy is generally 
made on the basis of an estimated ability to 
benefit from treatment. In this respect, the 
present findings imply potentially useful cri- 
teria for the prediction of identification, which 
may be a highly relevant factor in prognostic 
evaluation. 


SUMMARY 

The Ss of this study were male and female 
adolescents (12-15 years of age) who were 
asked to volunteer to participate in groups that 
“would discuss adolescent problems.’’ Those 
who volunteered were rated by their teachers 
on emotional adjustment. Ss rated as most 
poorly adjusted were eliminated from the 
study. Three therapy groups were organized, 
each composed of five boys and five girls, 
each in the charge of a different therapist. The 
members of the group were given a special 
questionnaire, designed to assess needs, and a 
group form of the Rorschach before therapy 
began. At the conclusion of therapy, all Ss 
again took the questionnaire. Two or more 
judges rated each therapist on the question- 
naire items. 

A measure of identification was obtained by 
taking the number of shifts by the Ss from 
pre- to posttherapy questionnaires which were 
in the direction of the therapist as defined by 
the combined observer ratings. Measures of 
lability and rigidity were derived from the 
Rorschach tests. 


The hypotheses that both rigidity and la- 
bility were inversely related to identification 
were tested by correlational methods. Resu!ts 
were positive in both instances. 
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SOME RELATIONSHIPS AMONG PROJECTIVE TEST, INTERVIEW, 
AND SOCIOMETRIC MEASURES OF DEPENDENT BEHAVIOR! 
BERNARD J. FITZGERALD* 

VA Hospital, Roanoke, Virginia 


HE need for experimental evaluation of 

projective test data as used for de- 

scribing and evaluating behavior has 
frequently been pointed out (4, 7). Some 
authors disparage such experimental evalua- 
tion, suggesting that perhaps clinically useful 
evidence is distorted or lost in the process of 
quantification (2, 3). However, it would seem 
that systen.atic investigation of projective test 
data is essential, particularly in view of the 
failure of many studies to predict successfully 
from projective test responses to overt beha 
vior, even on the basis of clinical evaluation of 
the data. 

The present study was developed using de- 
scriptive constructs from Rotter’s social learn- 
ing theory (6). From the point of view of social 
learning theory, any given situation has within 
it possibilities for the satisfaction of many dif- 
ferent needs. Which of these needs an individ- 
ual will strive to satisfy will depend upon two 
factors, the valwe that the satisfaction of a 
given need has for him and his expectancy that 
his behavior will lead to that satisfaction. The 
value which the satisfaction of a given need has 
for the individual is termed Need Value. In the 
measurement of Need Value, emphasis is upon 
what need satisfactions the individual prefers. 
An individual’s expectations that particular 
behaviors will lead to the desired need satis- 
faction is termed Freedom of Movement. 
Emphasis is upon what the individual expects 
to happen. Freedom of Movement is high if 
the individual expects his behaviors to lead to 
need satisfaction. An individual’s Freedom 
of Movement is low if he expects failure or 
punishment for such behaviors. One method 
of measuring Freedom of Movement depends 
upon identifying the frequency with which the 
individual, when seeking any particular need 
satisfaction, resorts to symbolic rather than 


! This paper is based on a doctoral dissertation done 
at the Ohio State University. The author acknowledges 
with gratitude the advice and guidance given him by 
J. B. Rotter, under whose direction this study was 
carried out. The writer also is grateful to Gordon Rader 
for his helpful criticism of this manuscript. 

? Now at Philadelphia State Hospital. 


direct, overt behavior to obtain that satisfac- 
tion; the more he resorts to symbolic behavior, 
the lower is his Freedom of Movement. 

In terms of this study, it was hypothesized 
that if projective test responses represent, in 
part, symbolic ways of obtaining gratification, 
then frequency of response should be highest 
when Freedom of Movement is low and Need 
Value high. This hypothesis assumes that a 
projective test provides a situation in which 
symbolic gratification is possible; therefore, 
the individual tends to give responses involv- 
ing satisfactions that have a high value for 
him but for which he has a low expectancy of 
realization. Such a situation, where Need 
Value is higher than Freedom of Movement, 
is considered in this study to be a situation 
of important conflict. 

The purpose of this study was to provide 
data related to the interpretation of needs as 
reflected in the content of projective test re- 
sponses. Dependency, the need investigated, 
may be defined as the need to have other 
individuals prevent frustration or punishment 
and provide for the satisfaction of other 
needs. The projective tests used were the 
Murray TAT and the Rotter Incomplete 
Sentences Blank (ISB). The measures of overt 
behavior were a sociometric technique and an 
interview. The measures of Need Value and 
Freedom of Movement for Dependency were 
also derived from the interview. 

The hypotheses of this study are as follows: 

1. Frequency of n Dependency responses in 
a projective test is positively related to socio- 
metric and interview indices of Dependent 
behavior; and 

2. Frequency of n Dependency responses in 
a projective test is positively related to the 
importance of conflict in n Dependency. 


METHOD 


Eighty Ss, 20 frora each of four social fraternities, 
participated in the study. All Ss had been members of 
their respective fraternities for at ‘east six months and 
students at the Ohio State University for at least one 
year. A pilot study was devoted to the development 
of the interview, the sociometric measure, and judges’ 
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manuals for scoring interview and projective test 
responses. 

The criteria of overt behavior were a sociometric 
index and an interview measure of Dependent be- 
havior. A nominating technique was used as the socio- 
metric measure. Two descriptions were provided the 
Ss in order to make their nominations. One of these 
descriptions was worded to describe certain behaviors 
considered most and another worded to describe be- 
haviors least representative of Dependent behavior. 
Care was taken not to label the descriptions as repre- 
sentative of any particular kind of behavior, e.g., De- 
pendent. Ss were asked to select from the members of 
the group those who, in general, best fit one or the 
other of the two descriptions. The nominations were 
made alternately, i.e., the one S who behaved most 
Dependently, the one S who behaved most non-De- 
pendently, and so on. In this manner, then, all Ss made 
three nominations for each description.’ 

A structured clinical interview was developed to 
elicit from these same Ss historical data relevant to 
their Dependent behavior and data relevant to their 
Need Value and Freedom of Movement for Dependency 
satisfaction. The interview was divided into eleven 
areas: school, home, advisor, major academic subject, 
roommate, friends, girl friends, health, personal prob- 
lems, and religion. A manual for scoring the interview 
variables of Dependent behavior, Need Value, and 
Freedom of Movement for Dependency satisfaction 
was developed. Included in the manual were the defini- 
tion of Dependency and a set of general principles 
derived from the definition, as well as examples from 
interview protocols obtained in the pilot study. Ex- 
amples were derived from these interviews to correspond 
to the points 1, 4, 7, and 10 on a ten-point rating scale. 
The in-between points were included to allow finer 
discriminations than were possible from the selected 
referents in the manuals. The values attached to the 
examples in each of the manuals were arrived at by 
the experimenter’s first assigning crude values con 
sistent with the context from which the examples were 
taken. Two independent raters also assigned values to 
these examples, and the scale values were adjusted to 
maximize agreement among the three raters. Finally, 
these refined scales were discussed with the interview 
judges. Changes were made in the scales where neces- 
sary for complete agreement among the judges, and the 
examples were then assigned final values. 

Judges’ scoring manuals were similarly developed 
for the TAT (Cards 3BM, 4, 6BM, 7BM, 8BM, 10, 
12M, 13MF, 15, 17BM, 18BM, 20) and the ISB. From 
the TAT stories obtained in the pilot study, a manual 
was prepared to aid in the determination of the presence 
or absence of a Dependency plot for any given story. 
In the development of the manual, each pilot protoco! 
was carefully read, and any story which could be con- 
sidered to have a Dependency plot was noted. Two 
projective test judges and the experimenter discussed 
those stories which were “borderline” or difficult to 


* The method of nominating only the extremes of a 
group has been found to be at least as reliable as rating 
an entire group and to provide a distribution of ratings 
similar to that obtained when each member of a group 
is to be rated by every other member 
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judge for n Dependency. Foliowing discussion of each 
borderline story, the three judges agreed on each story 
as representative of either Dependency or non-De- 
pendency. These stories, as well as stories clearly repre 
sentative of n Dependency, were included in the final 
form of the manual. Dependency plots were identified 
in relation to both the Hero and the Other Figures. 
Suggestions were included in the manual to promote 
agreement among the judges in the determination of 
the Hero figure. From ISB responses obtained in the 
pilot study, as well as from a large pool of items gathered 
previously by others, a manual for scoring ISB responses 
for n Dependency was developed in a similar fashion 
In making their ratings, the judges were asked to use 
the manuals by comparing the TAT or ISB item to be 
rated with examples included in the manuals. Approxi 
mately 30 hours were spert in training the judges in 
the use of the manuals. 


Procedure 


The Ss were first seen at their respective fraternity 
houses, where the sociometric and projective measures 
were obtained. Following the group’s completion of 
the sociometric ratings, the TAT cards were projected 
on a screen. Ss were allowed six minutes, the TAT card 
remaining in view, to write each story. Finally, the 
ISB was administered. Each S was interviewed indi 
vidually within a two-week period following the group 
testing. All interviews were recorded on the Gray 
Audograph, and the judges made their ratings of the 
interviews from these recordings. The experimenter and 
two interview judges, working independently, ratec the 
interviews. The experimenter and two different judges, 
working independently, rated the projective test re- 
sponses. The protocols and interviews were coded so 
that the identity of the individual Ss was not known 
to the judges. All TAT and ISB responses for any 
given card or item were rated before continuing to the 
next item, and all responses were rated by at least two 
judges. 

Estimates of the interrater reliabilities for the 
interview variables, the TAT, and the ISB were ob- 
tained by correlating (Pearson r) the ratings obtained 
from each judge with those of each of the other two 
judges. The interview variables were each rated on a 
ten-point scale. For the TAT and the ISB, each S was 
given a score by counting the number of stories or items 
rated as Dependent. The average interrater correlations 
for the interview were as follows: Dependent behavior, 
.54; Need Value, .45; Freedom of Movement, .56. The 
average interrater correlations were for the TAT 
(Hero), .75; TAT (Other), .11; and the ISB, .78. Be- 
cause of the unreliable TAT Other Figure ratings, this 
index was dropped from consideration. The sociometric 
ratings yielded corrected split-half reliability of .83 


RESULTS 


A test of the first hypothesis concerning the 
relationship between projective test responses 
and overt Dependent behavior was made by 
correlating (Pearson r) the interview and the 
sociometric ratings of Dependent behavior 
with the frequency-of-response ratings on the 
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AT and on the ISB. It appears from Table 1 
hat there is no evidence of a relationship be- 
tween frequency of response in n Dependency 
on the TAT and either the interview or the 
sociometric measures of Dependent behavior. 
[he ISB, in terms of frequency of response in 

Dependency, yields indices which are re- 
lated to both at low but significant magni- 
tudes. 

[he second hypothesis concerning the rela- 
tionship of frequency of response in n Depend- 
ency to importance of conflict in n Depend- 
ency, as determined from the interview, was 
evaluated by the chi-square test. In order to 
make divided into two 
groups for each projective test respectively, 
i.e., those Ss with a high number (four or more 
on the TAT; seven or more on the ISB) vs. 
those with a low number (three or less on the 
IAT; six or less on the ISB) of Dependency 


The high and low groups for each 


this test, Ss were 


responses. 
test were compared on the basis of an im- 
portance-of-conflict score which was derived 
the following manner: Ss with Freedom of 
[ovement equal to or greater than Need 
alue were considered to have ““No Important 
‘onflict” in n Dependency and were com- 
yvared with those Ss whose Freedom of Move- 
ment was less than Need Value. These latter 
Ss were considered to have an “Important 
Conflict” in n Dependency. The results of 
this comparison are reported in Table 2. It 
appears that those Ss who have a relatively 
higher Need Value than Freedom of Move- 
ment for Dependency satisfaction give sig- 
nificantly more responses in n Dependency on 
the TAT. The the ISB are not 
significant. 

It seems evident also from Table 2 that the 
[AT tends to elicit dissimilar data, in terms 
of a frequency-of-response measure, from that 
elicited by the ISB. Comparisons of these two 
tests in terms of frequency of response in n 
Dependency as related to interview and socio- 
metri' measures of Dependent behavior 
(Table 1) support this conclusion. Further 
evidence that these two instruments tend to 
elicit dissimilar data is given by their low 


“as & 


results for 


intercorrelation (.09). 

On the other hand, the correlation between 
ratings of overt Dependent behavior from the 
interview and from sociometric measures was 
59. Important to note here is the potential 


usefulness of the economical sociometric 
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TABLE 1 
CORRELATIONS BR TWEEN THE MEASURES OF De 
PENDENT BEHAVIOR AND FREQUENCY OF 
RESPONSE RATINGS ON THE 
PRojJEcTIvE TESTS 


(N = 60) 
TAT ISB 
Interview .07 .28* 
Sociometric 06 = 
°p = 05 
TABLE 2 
Cui SQUARE OF THE FREQUENCY OF DEPENDENCY 


RESPONSES ON THE PROJECTIVE MEASURES 
COMPARED TO CONFLICT IN 
NEED DEPENDENCY 
(N = 60) 


Total Number of Depend 
ency Responses 


TAT ISB 
High Low | High Low 
D D D 
Group Group/Group Group 
“Important Conflict’ 88 27 183 79 
(Freedom of Movement 


less than Need Value 
“Not Important; Con 16 18 65 37 

flict”’ (Freedom of 

Movement equal to or 

greater than Need 

Value 
x? = 12.24; | x? = .99; 
= .001. p = .30 


technique in relation to the laborious clinical 
interview as a quantified index of certain 
kinds of overt behavior. 

CONCLUSIONS AND DISCUSSION 

The following conclusions, all in terms of 
frequency of response in n Dependency, may 
be drawn from this study: 

1. There is no evidence that TAT responses 
are related to interview or sociometric indices 
of Dependent behavior. 

2. TAT responses are significantly related 
to an interview measure of importance of 
conflict in n Dependency. 

The foregoing conclusions lend support to 
two assumptions listed by Lindzey in his 
review of the clinical use of the TAT. One 
assumption states, “The dispositions and con- 
flicts that may be inferred from the story- 


teller’s creations are mot always reflected 
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directly in overt behavior.’ Another holds 
that, ‘Themes that are recurrent in a series 
of stories are particularly apt to mirror the 
impulses and conflicts of the story-teller’” (4, 
p. 3). The similarity of these assumptions to 
the hypotheses in this study is evident. 

3. ISB responses tend to be related to inter- 
view and sociometric measures of overt, De- 
pendent behavior. 

4. ISB responses show no evidence of rela- 
tionship to an interview measure of impor- 
tance of conflict in n Dependency. 

The third conclusion is similar to that of 
McClelland (5), who, using a Sentence Com- 
pletion Test to measure n Achievement, 
argued that this test provides results which 
are dissimilar to those provided by the TAT 
and which are more similar to conscious self- 
judgments. Blyth (1) found also that the ISB 
differentiated between groups of veterans who 
behaved dependently or independently in 


terms of accepting or rejecting offers of psy- 
chotherapy. 

It is evident that the TAT and the ISB, 
although both projective tests, provide dis- 


similar data in terms of frequency of response 
in a given need category, and that inferences 
from one test cannot be generalized to the 
other. More specifically, where inferences 
about overt behavior are to be made, a highly 
structured, behaviorally oriented projective 
test, such as the ISB, may be the more appro- 
priate. The TAT does not appear to be par- 
ticularly useful as a source of inferences about 
overt behavior. Where inferences about coa- 
flict in a given need area are to be made, a 
projective test that allows considerable free- 
dom of response, such as the TAT, is proba- 
bly appropriate. The ISB appears to be less 
appropriate as a source of inferences about 
conflict in a given need. 

The unmodified TAT and the ISB were 
used in this study as they generally are em- 
ployed in clinical psychological work. It is 
important to emphasize again that the meas- 
ure used in relation to the projective test re- 
sponses was a frequency-of-response measure. 
It is not intended to imply that such a meas- 
ure of projective test responses is the only 
possible one or that such a measure is more 
valid or economical than other possible uses 
of such projective test responses. In fact, it is 
evident that a frequency-of-response measure 
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makes no more than minimal use of the data 
provided by the TAT and the ISB. Neverthe- 
less, with descriptive constructs from Rotter’s 
social learning theory, which provided for an 
objective and communicable method, a pre- 
dictive relationship was demonstrated between 
frequency of response in n Dependency on the 
TAT and an interview measure of importance 
of conflict in n Dependency. 


SUMMARY 


Data relevant to two hypotheses concerned 
with the evaluation of projective test responses 
were obtained in this study. The projective 
test measure was frequency of response in n 
Dependency on the TAT and on the ISB. A 
structured clinical interview and a sociometric 
technique were used as measures of overt 
Dependent behavior. The clinical interview 
also was structured to provide measures of 
Need Value and Freedom of Movement for n 
Dependency. Eighty Ss participated in the 
study. A pilot study with 20 of these Ss pro- 
vided data for judges’ scoring manuals for the 
interview and projective test situations. All 
Ss were tested in groups and interviewed in- 
dividually. 

Results of the study may be summarized as 
follows: 

1. Frequency of response in n Dependency 
on the TAT yielded no evidence of relation- 
ship to the interview and sociometric measure: 
of overt Dependent behavior. 

2. Frequency of response in n Dependency 
on the ISB was related to interview and 
sociometric measures of overt Dependent 
behavior. 

3. Importance of conflict in n Dependency, 
as derived from the interview, was related to 
frequency of response in n Dependency on the 
TAT. The results for the ISB were not sig- 
nificant. 

4. Dissimilar data, in terms of frequency of 
response in n Dependency, were elicited by 
the TAT and the ISB. 

5. A high degree of similarity was found 
between the sociometric and interview ratings 
of overt Dependent behavior. This suggests 
the usefulness of sociometric ratings in lieu of 
clinical interviews to obtain an approximate, 
quantifiable measure of certain kinds of overt 
behavior. 
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THE REPRESSION HYPOTHESIS STUDIED IN A SITUATION OF 
HYPNOTICALLY INDUCED CONFLICT 
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N recent years, a number of experimental 
psychologists have tried to rewrite psy- 
choanalytic theory in operational terms. 

The concept of repression has stimulated a 
variety of studies (13, 16, 21, 22, 23, 24, 25) 
that have succeeded primarily in pointiig up 
methodological problems but have contributed 
little to our knowledge of repression as either 
a phenomenon or a process. This paper reports 
a study made ten years ago (3) that still seems 
pertinent today. 

Repression, as a method of ego defense, 
protects the individual from painful emotional 
states by keeping ideas that represent certain 
impulses out of consciousness (7, p. 86). Fol- 
lowing Freud’s formulation, we may identify 
the necessary conditions for the development 
of repression as these: (a) An instinctual im- 
pulse is aroused; (5) gratification of this im- 
pulse is in some way unacceptable to the ego of 
the individual; (c) pain (anxiety) has been 
experienced as the ego demands have been 
frustrated in gratifying the instinctual ten- 
dency (7, p. 84; 9, p. 119); and (d) the need to 
avoid this anxiety is stronger than the instinc- 
tual drive (7, p. 85). 

As a process, repression as conceived by 
Freud has the following dynamic characteris- 
tics: 

1. The repressed is only separated from 
consciousness (awareness) and is presumably 
still active in the unconscious. In fact, the 
extent and degree of repression varies as in- 
hibitory and instinctual propensities fluctuate 
(7, pp. 89-90). 

2. Generalization of cues and responses 
often occurs. Cues originally related 
the drive may be repressed, but other rep- 
resentations may be admitted to consciousness 
and acted upon with the accumulated energy 
of the repressed (7, pp. 86-87). Or, the related 


1 The investigation was conducted under the di- 
rection of Professor Charles Strother; technical 
assistance was given by Drs. Wilbur R. Miller and 
Paul Huston of the Psychopathic Hospital, and by 
Professors Robert R. Sears and E. E. Lindquist of the 
State University of Iowa. The author now resides at 
3900-50 N.E., Seattle, Washington 
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ideas may become so associated with the re- 
pressed that they must also undergo repression, 
if the individual is to avoid anxiety (Freud’s 
“repression preper,” (7, p. 86). If repressed 
impulses are satisfied, it must be through sub- 
stitute activities. Unless such satisfactions are 
complete, the need persists, to be expressed 
through an increasing number of substitute 
ideas and with “the damming-up consequent 
on lack of real satisfaction” (7, p. 87). 

Since ‘‘the motive and purpose of repression 
is simply the avoidance of ‘pain’”’ (7, p. 92), 
it follows that repression may be considered 
successful only to the extent that it does avoid 
anxiety or pain. It has been noted by Freud 
(10, pp. 21-22) and by Merrill (16) and Zeller 
(23) that unsuccessful, i.e., partial repression, 
or repression in which at least the affective 
component has become attached to a substitute 
idea is most amenable to analysis or experi- 
ment. These emotional components become 
apparent when one considers the dynamics of 
psychoanalytic therapy in which the aim is to 
remove repressions. Freud says, “... the pa- 
tient can go on spinning a whole chain of . . . 
associations, till he is brought up in the midst 
of them against some thought-formation, the 
relation of which to what is repressed acts so 
intensely that he is compelled to repeat his at- 
tempt at repression” (7, p. 88). Finally, sum- 
marizing the formula for therapy, “. . . all re- 
pressions are to be u Jone; the mental condi- 
tion is then the same as if all amnesias are re- 
moved. When all gaps in memory have been 
filled in, all the enigmatic products of mental 
life elucidated, the continuance and even the 
renewal of the morbic condition is impossible” 
(8, p. 269). 

In this theoretical context, this investigation 
is specifically concerned with anxiety behavior 
as the response variable to be measured as a 
function of degree (completeness) of repression. 
In terms of the general hypothesis, and apart 
from therapeutic intervention, the following 
relationships are predicted: In the 2dsence of 
either conflict or repression, there should be no 
anxiety behavior. In conflict with full re- 
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pression, there should be no anxiety behavior 
except as the repressive process fluctuates. 
With incomplete repression, there should be an 
increase in the scope and intensity of anxiety 
disturbance. As repression ceases to operate 
disturbance should reach a maximum and show 
considerable generalization. Finally, when both 
repression and conflict are removed there 
should no longer be anxiety behavior related 


to the conflict. 

Most of the investigations of repression have 
utilized some measures of memory as the re- 
sponse variable. Zeller (23, pp. 40-45) recently 


provided an excellent critical review of these, 
pointing out the fallacies and inzdequacies of 
The kind and amount of feeling tone, 
threat in the 
situation are seen as important 


eacn 


emotional disturbance, or ego 


experimental 
factors. He also stresses that any critical study 
must demonstrate “restoration to conscious- 
ness of the repressed material’ upon removal 
of the repression factor (23, p. 46). 

Zeller (23, p. 44) and other writers (16, 22) 
have noted the Huston, Shakow, and Erickson 
experiment (13) as one of the better demon- 
strations of repression, although the use of hyp- 
nosis makes interpretation somewhat difficult. 
The criticism in terms of dependence on hyp- 
nosis loses some weight when we consider that 
results in conformity with the repression hy- 
pothesis occur as predic ted, and that these re- 
sults could not have been predicted on the 
basis of hypnosis alone. The hypnotic approach 
surely warrants further investigation, if only 
for the fact that other approaches thus far de- 
vised involve variables perhaps even more 
difficult of appraisal, i.e., learning, motivation, 
reminiscence, retroactive inhibition, etc. (24, 
p. 422). 

A further consideration in experimental de- 
sign is the response to be measured as a func- 
tion of repression. In the case of most of the 
experiments measuring memory responses, 
“anxiety” or some form of personal involve- 
ment (frustration, incompleteness, unpleasant- 
ness) is inferred. In the study of Huston, 
Shakow, and (13), the 
responses measured are presumed on the basis 


Erickson however, 
of tests of validity to be measures of emotional 
disturbance or anxiety. The condition of re- 
pression is of course also independently vali- 
dated. 


REPRESSION HYPOTHESIS AND HyPNOTICALLY INDUCED CONFLICT 


METHOD 


This experiment employs hypnotically in- 
duced conflicts as described by Erickson (6) 
tested according to Luria’s association test 
procedures (13, 15). Experimentally induced 
conflicts offer opportunity for necessary con- 
trol. Except when hypnosis has been emplov: 
however, studies of conflict in human subjects 
(Ss) have been too circumscribed and moderate 
for our purposes. The hypnotic method has 
been found to have some validity, i.e., the ex- 
perimenter can define the nature of the conflict, 
assure the S’s susceptibility to and acceptance 
of it, suggest in general the nature of his reac- 
tion to it, create the effects of repression and, 
finally, remove the hypnotic effects without 
undesirable residual reactions. 

Luria’s method for testing emotional dis- 
turbance calls for conditioning a voluntary 
motor pressure of the preferred hand to occur 
simultaneously with a verbal response. The 
theory is that an integrated response of low 
habit strength under voluntary control is 
notably susceptible to disorganizing effects of 
emotional disturbance. His report of the gen- 
eral validity of the method has been verified 
in a group of studies (1, 4, 5, 11, 12, 13, 18, 
19, 20). 

The design of this experiment is as follows: 
Under hypnosis, an incident created to arouse 
a severe state of conflict is related to six se- 
lected Ss. In the waking state, responses to a 
standard set of 100 association test words are 
recorded under five experimental conditions: 
(a) before the conflict is induced; (6) after in- 
duction of the conflict and with posthypnotic 
amnesia for it; (c) with partial posthypnotic 
amnesia; (d) with full awareness of the conflict 
as an immediate actuality; and (e) after hyp- 
notic removal of the conflict. The association 
test is composed of “critical” words relating 
directly to the conflict story; “postcritical” 
words immediately following critical words, 
and “neutral” words to which the S’s re- 
sponses were in no way disturbed on a prelim- 
inary testing. The three experimental variables 
are thus subjects (assumed to contain a random 
effect), types of verbal stimuli and conditions 
of conflict awareness. The simple factorial 
design permits evaluation of the three vari- 
ables in all combinations by means of analyses 
of variance. 





206 


Subjects 

Ss were selected from a group of male junior and 
senior medical students who had no previous experience 
with hypnosis, but were willing to serve in a reaction 
time experiment involving hypnosis. After training, 
each S was rated hypnotizable and capable of com- 
plete posthypnotic individuals were 
accepted whose personalities were rated “extreme” 
on the basis of a brief Rorschach evaluation, a psychi- 
atric interview and behavior during hypnotic training 


amnesia.2 No 


sessions. 


Procedure 


The same conflict story and instructions governing 
degree of awareness were related under hypnosis to 
each S. In outline, the story was this: 

In medical school, the S has had an experience so 
disturbing that he had to forget it. As it is recalled, 
he will re-experience it in detail. He was attending a 
party and volunteered to borrow a car and go for 
more beer. On his way back, he drove rapidly around a 
dark corner and felt the car hit something. He feared 
an accident but did not want to stop since he was in a 
borrowed car and had been drinking. He rationalized 
that it had probably been only a bump in the road and 
drove on. Next morning his fears were confirmed in 
a newspaper story of a child, near death and hopelessly 
crippled as the result of a hit and run accident. The 
details of the account made it clear to him that he was 
the driver for whom the police were searching. He 
experienced great panic and conflict about what to do 


Each condition of awareness was induced under 
hypnosis after review of the story. For the postconflict 
condition, the hypnotist proved to S under hypnosis 
that the incident could not have occurred and talked 
freely with him to assure removal of the conflict. After 
completion of the experiment, a final hypnotic session 
assured removal of the conflict; it was suggested also 
that the S would recall his thoughts and feelings, 
wanting to talk freely about them when he awakened 

The association test was administered before hyp- 
nosis (Condition I), in the waking state after each 
hypnotic (Conditions II, III, and IV), and 
after the conflict was removed (Condition V). 

Independent ratings of each S’s behavior and intro 
spective accounts were made by two judges, the hyp- 
notist and the £. Unanimous agreement that S had 
accepted the conflict and with respect to levels of 


session 


awareness was required 
The 100-word 
neutral, 20 critical, and 20 postcritical 
individually constructed for each S. 
postcritical words were selected on the basis of S’s 


test, composed of 60 
words, was 
Neutral and 


association 


responses to a preliminary test of 210 words. Words 
were excluded when any measure of response deviated 
markedly from his mode. Eighty were then 
drawn at random from the remaining list. The critical 
words were either key words in the conflict story (col- 
lision, party, newspaper, or suggested 
feelings and ideas closely related to it 


words 


borrow . ee 
criminal, guilty, 


2? Hypnosis was conducted by three psychiatrists, 
Drs. Paul Huston, A. W. Freidinger, and F. S. Bobbitt 
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conflict, etc.). These words were randomized and 
distributed at varying intervals among the neutral 
words. The same order of was maintained 
throughout the testing.’ 

The apparatus employed (2, 3) 
response measures: verbal reaction time (VRT), time 
between beginning of verbal response and beginning of 
voluntary motor pressure (VR-VMP), number of 
VMP made during a single stimulus-response sequence 
(#VMP), duration of the first VMP in a stimulus 
response sequence (DVMP), number of involun 
tary motor pressures occurring in a stimulus-response 
sequence (# IMP), duration of IMP (DIMP), breath 
ing rate (BR), and number of conflict-associated 
verbal responses obtained during a single testing period 
(#CAVR). These measures were reliable, as indicated 
by agreement between two scorers. Reliability co 
efficients range from .84 to .997 on 
and percentages of agreement range from .916 to 1.00 
on counted items 


words 


furnished several 


measured items 


RESULTS 


The validity of the experimental! situation, 
apart from that of the response measures, is 
assumed on the basis of unanimous judges’ 
ratings and on the consistency among observa- 
tions and introspective reports. Furthermore, 
there appears to be descriptive similarity be- 
tween repression, as postulated here, and the 
posthypnotic conditions of awareness. 

Validity of response measures could be tested 
more directly. By means of analyses of vari- 
ance, the three experimental variables (sub- 
jects, stimuli, and conditions) were tested in 
interaction with one another (14). Since none 
of these tests was significant, the specific dif- 
ferences relating to validity and to the experi- 
mental hypotheses were tested by the triple 
interaction variance as the error term. 

Table 1 summarizes results of the validity 
tests.‘ It is seen that (a) all response measures, 
except BR, tend to differentiate responses in 
the conditions of conflict from those in condi- 
tions of nonconflict (see Lines 1a, 1b, and 1c); 
(6) all measures but DVMP tend to differen- 
tiate between responses to critical and to 


’ Maintaining the same sequence of words through 
out the experiment makes likely the operation of prac- 
tice effects in opposition to the experimental] hypotheses. 
Changing sequence, however, might introduce factors 
less subject to appraisal 

‘ Tests significant at or below the .05 level of con- 
fidence are considered “‘significant’’; those between the 
.05 and .20 levels are considered “suggestive.”’ This 
procedure is used to point up consistent trends. The 
small number of Ss makes necessary large differences, 
considerable precision and intersubject homogeneity 
for a high level of statistical significance; requirements 
difficult of fulfillment in an experiment of this kind 
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TABLE 1 
RESULTs OF VALIDITY TEsTs oF ALL RESPONSE MEASURES SHOWING FOR EACH THE LEVEL OF 


CONFIDENCE WITH WHICH THE Hypotuesis OF No DIFFERENCE MAY BE REJECTED 


Difference tested 





Response Measure* 


VRT | VR-VMP DVMP * VMP wR # IMP DIMP CAVR 
om 
1. Between means for conflict and 
nonconflict conditions 
a. Critical words, conflict minus} 1% | 2-5% 5-10% | 10-20% t 5-10% 5-10% » 
critical words, nonconflict 
b. All words, conflict minus all 20% 
words, nonconflict 
c. All words, conflict minus all 10-20% 
words, postconfiict 
2. Between mean response to crit 
ical and to neutral stimulus 
words | 
a. Critical words, conflict minus} 1% | 5-10% 20% 10-20% 10-20% | 10-20% 
neutral words, conflict 
b. Critical words, nonconflict | 20% 20% 20% 20% 20% 20% 


minus neutral words, non 
a 


confiict 


® Response measure 
VRT—Verbal reaction time 


VR-VMP—Time between verbal response 
and voluntary motor pressure 
DVMP—Duration voluntary pressure 


VMP 


Response measure cc 


essure 
ld not be tested for this difference 


Voluntary motor p 


neutral words under the conflict conditions 
(Line 2a). Conversely, no difference between 
responses to these words is shown in the non- 
conflict conditions (Line 26). It is interesting 
to note, in terms of Luria’s findings, that VRT 
and VR-VMP can be considered valid with 
greater confidence than any of the other 
measures. 

It is now possible to evaluate the findings in 
relation to the experimental hypotheses. When 
hypothetical results (Fig. 1) are compared 
with obtained results for the group (Fig. 2), 
general apparent. However, 
maximum disturbance occur more 
often in Condition III (partial awareness) than 
in Condition IV (full awareness), and there is 
little generalization of disturbance in responses 


conformity is 
tends to 


to postcritical and neutral stimuli. 

The specific experimental predictions have 
been subjected to statistical test; reported in 
Table 2, 
ranted by the validity tests (Table 1) and 
study of Fig. 2. Breathing rate and conflict- 


however, are only those tests war- 


associated verbal responses are omitted as are 
comparisons involving responses to postcritical 
and neutral words. 

1. There was little disturbance under the 
condition of conflict without awareness (Table 


BR—Breathing rate 


IMP—Involuntary motor pressure 
DIMP—Duration involuntary pressure 
CAVR—Conflict associated verbal response 


2, Line 1). Only VRT and DIMP tend to 
differentiate between these conditions. That 
there is no particular disturbance in relation to 
other types of stimuli is evident in Fig. 2. 

2. There is increased disturbance in response 
to critical words in the condition of partial 
awareness over the condition of conflict with- 
out awareness (Table 2, Line 2) and over the 
condition of preconflict (Table 2, Line 3). 
There is also the predicted involvement of more 
response measures (Table 2, Line 3 in relation 
to Line 1), but obviously not a significant 
generalization of disturbance to postcritical 
and neutral stimuli (Fig. 2). 

3. The predicted further increase (to a 
maximum) in disturbance under the condition 
of conflict with full awareness (Table 2, Lines 
4 and 5) did not occur. Fig. 2 shows that, for 
most measures, disturbance in this condition 
(IV) is somewhere between the other two con- 
flict conditions. For all measures though, re- 
sponses to critical words are significantly dif- 
ferent in the condition of full awareness from 
those in the preconflict condition (Table 2, 
Line 6). 

4. All response measures lose their disturbed 
character when conflict is removed but aware- 
ness maintained (Table 2, Line 7). No response 
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CONDITIONS 
RESPONSE 
MEASURE 





VRT 


IN SECONDS 





VR-VMP 


IN SECONDS 











NO. VMP 


IN NUMBERS 





NO. IMP 


IN NUMBERS 





DIMP 


IN SECONDS 





BR 


IN SECONDS 





NO. CAVR 


IN NUMBERS 





























RESPONSE 0 i Vv Vv 
— CONDITIONS 


Fic. 1. HypotHeticaL Curves OF RESPONSE MEASURES FOR THE ToTaL Group BY EXPERIMENTAL CONDITIONS 
AND Types OF STIMULI 


LEGEND: 
CRITICAL 
POST CRITICAL 
NEUTRAL ——-— 





THe REPRESSION HypoTHESIS AND HyPNOTICALLY INDUCED CONFLIC1 


CONDITIONS 
RESPONSE 
ME ASURE 





Lee 
VRT 


InN SECONDS 





VR- VMP 


in SECONDS 











DVMP 


IN SECONDS 





NO.VMP 


IN NUMBERS 





NO. IMP 


IN NUMBERS 








DIMP 


IN SECONDS 





BR 


IN SECONDS 








NO. CAVR 


IN NUMBERS 





























RESPONSE 0 
MEASURE CONDITIONS 


or RESPONSE MEASURES FOR THE TotaL Group BY EXPERIMENTAL CONDITIONS AND 


pt Iv 


Fic. 2. MEANS 
Types oF STIMULI 


LEGEND: 
CRITICAL 


wer Ral 





Rutu A. Bossitt 


TABLE 2 


Tests OF THE “DEGREE OF 


AWARENESS” HypoTHEsis SHOWING 


THE LEVEL OF CONFIDENCE WITH 


Wuicu tHe Hyporuesis oF No Dirrerence May Be REeEjJEcTED 
For each response measure, for critical stimulus werds only) 


Difference Between Means 


Preconflict minus conflict 
without awareness 

Conflict without awareness 
minus con‘lict with partial 
awareness 

Preconflict minus conflict 
with partial awareness 
Conflict with partial aware 
ness minus conflict with full 
awareness 
Conflict without 
minus conflict with full 
awareness 

Preconflict minus conflict 
with full awareness 
Preconflict minus postcon 
flict 


awareness 


20% 


* This tendency is 


measure differs significantly between precon 
flict and postconflict conditions. With no data 
available on conflict-associated verbal re- 
sponses in the preconflict condition, it is not 
possible to evaluate the significance of the 
average of four such responses in the postcon- 
flict condition (Fig. 2). 
DISCUSSION 

Two discrepancies, one major and one minor, 
between the experimental hypotheses and ob- 
tained data have been noted. First, maximum 
disturbance did not occur in Condition IV. In 
this connection, it must be considered that: 

1. All obtained differences between condi- 
tions of awareness may be due to chance. 
However, the consistent statistical, graphic, 
and introspective results make this a remote 
possibility. 

2. Contrary to the judges’ evaluations and 
the introspective accounts, there may have 
been no real difference in awareness between 
Conditions III and IV. 

3. With development of full awareness, the 
situation may have had less conflict-value. No 
S expressed doubt that the accident occurred. 
Three Ss stated that they feared inevitable 
punishment and show a general downward 
trend in disturbance. Two Ss had decided 
against conscience and in favor of self-protec- 
tion. One of these showed greatest disturbance 


“VR VMP 


")O7 
20% 


5-10% 


oO 5 07 o 
ie 2-5% 1-2% 1% 


20% 


20% 
5% 1-2% 10-20% 2-5% 


20% 


Response Measure 


DVMP  VMP_ 


DIMP 


IMP 


20% 20% 20% 10-20% 


20% 10-20% 10-20% 20% 


2-5% 


20% 20% 20% 20% 


10-20% 20% 20% 20% 


5-10% 


20% 20% 20% 20% 


for verbal reaction time to be significantly less disturbed 


in Condition IV, while the other showed 
somewhat inconsistent increases and decreases 
in disturbance. One S, whose behavior was 
much the same in Conditions ITI and IV, could 
see no solution to the conflict but was trying to 
avoid detection. 

If there was resolution of conflict in Condi- 
tion IV, the hypothesis as formulated may be 
true, but most of the test may have been ad- 
ministered after resolution began and may 
measure the process of conflict and anxiety 
reduction instead of reaction to the develop- 
ment of full awareness. 

4. The data may reflect an error in the 
formulation of the hypothesis concerning de- 
gree of awareness. The development of full 
awareness of a conflict may be accompanied 
by diminution in disturbance so that maximum 
disturbance occurs when awareness is at least 
partial but somewhat less than complete. The 
peak of disturbance may come with developing 
awareness, but almost immediately other de- 
fenses operate to reduce disturbance (anxiety). 

5. The effects of full awareness may have 
been counteracted by some habituation factor. 
Tests of practice effects, i.e., differences be- 
tween Conditions I and V, do not show sig- 
nificant results (Table 2, Line 7). Tests of 
abreaction or desensitization cannot be made 
senarate from those of the experimental hy- 
potheses. 
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The second discrepancy is that the expected 
generalization of disturbance to postcritical 
and neutral stimulus words did not materialize 
to any significant extent. The qualitative 
findings of Luria and of Huston, Shakow, and 
Erickson (13, 15) suggest such generalization, 
while the statistical findings of Gardner (11) 
do not. Generalization may depend oz: several 
factors, two of which should be mentioned here: 

1. The time interval between a verbal re- 
sponse and the succeeding verbal stimulus (in 
this study, approximately three seconds were 
allowed to elapse to permit dissipation of dis- 
turbance in the appropriate stimulus-response 
sequence). 

2. The remoteness of postcritical and neutral 
words from critical subject matter (in this 
experiment, rather unusual care was taken, in 
selecting neutral words, to avoid critical im- 
plications 


SUMMARY 


Based on the Freudian concept of repression, 
an experiment was designed to test the follow- 
ing specific hypotheses with respect to anxiety 
behavior: 

1. Repression serves to protect the individual 
from anxiety related to the repressed impulse. 

2. The less successful the repression, the 
greater is the anxiety experienced in relation to 
conflict between the repressed and the ego 
demands. 

3. Complete failure of repression results in 
maximum anxiety, a condition which persists 
until a new repression is instituted or another 
defense becomes operative. 

4. Complete relief from both repression and 
conflict results in cessation of anxiety. 

Through hypnotic induction and control of 
conflict and degree of awareness, five experi- 
mental conditions were produced: (a) precon- 
flict control; (6) conflict with posthypnotic 
amnesia for the situation; (c) conflict with 
partial posthypnotic amnesia; (d) conflict with 
full awareness; and (e) postconflict control. 
Eight measures of response to a word-associa- 
tion test administered by the Luria method 
were obtained for six Ss. Each S was tested in 
the waking state under the five experimental 
conditions. 

For the Ss as a group it was demonstrated 


that (a) there was a conflict; (6) there was a 


progressive increase in degree of awareness; 


and (c) most of the measures of behavior re- 
flected emotional disturbance with some va- 
lidity. 

Results were as predicted except in two 
respects: 

1. There was not the anticipated maximum 
disturbance when Ss were made fully aware of 
the conflict. It is suggested that (a) with failure 
of repression certain defenses quickly operate 
to diminish anxiety, or (6) the hypothesis re- 
quires reformulation at this point. 

2. Predicted generalization of disturbance 
to neutral stimulus material did not occur, 
although there was generalization to an in- 
creasing number of response measures. It ap- 
pears that the testing procedure allowed most 
disturbances to dissipate in the stimulus- 
response sequence in which they began. 
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THE EFFECT OF SIBLING COMPETITION AND ALLIANCE ON LEVEL 
OF ASPIRATION, EXPECTATION, AND PERFORMANCE! 


EVA DREIKURS FERGUSON 


Western Psychiatric Insiitute, University of Pittsburgh 


HE present study seeks to test certain 
hypotheses that have arisen from the 
Adlerian theory of sibling relationships. 
According to this theory, personality patterns 
are a reflection of goals that the individual 
establishes within the family constellation of 


the early years of childhood. A goal basic to all 


human behavior is the wish to belong, to be 
accepted (1, 2, 4), so that the child’s behavior 
patterns are expressions of trying to find a 
place, or status, within the family group. The 
presence of siblings in the family can be viewed 
by the child as an obstacle to attaining status, 
in which case the relationship he establishes 
with his siblings is a competitive one. If, how- 
ever, the sibling is viewed as an ally in the 
process of gaining status in the family, the 
interaction between these two children takes on 
a pattern different from that of the competi- 
tive relationship. Since early childhood striv- 
ings for status find expression in individual 
personality development, the personality pat- 
terns of the siblings reveal the type of rela- 
tionship established between them: allies be- 
come alike in personality, competitors become 
different. Dreikurs, in expanding the original 
conceptualization of Adler, has described as 
follows the dynamics of the sibling interaction 
and their effects on personality development: 

As eacl 
place within the group, the competing opponents watch 
each other carefully to see the ways and means by which 


member of the family strives for his own 


the opponent succeeds or fails. Where one succeeds, the 


other gives up; where one shows weakness or de- 
ficiencies, the other steps in. In this way competition 
between two members of the family is always expressed 
differences in character, temperament, in 
and abilities. Conversely, the similarity of 


Sometimes, 


through 
terests, 
characteristics always indicates alliances 
the two strongest competitors show no sign of open 
rivalry but, rather, present a close-knit pair; neverthe 
less, their competitive striving is expressed in per 


sonality differences p. 40, 4). 


! This study is a condensation of the doctoral dis 
sertation (6) conducted in partial fulfillment for the 
Ph.D. degree at the Department of Psychology, North 
western University. Grateful acknowledgments are 
extended to Drs. Benton J. Underwood, dissertation 
supervisor, and Rudolf Dreikurs, who served as inter 
viewer and gave invaluable suggestions. 
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Competition, as used here, thus describes a 
specific behavior pattern: giving up where the 
other person succeeds and striving to outdo 
only where the other fails, with the result that 
competitors display contrasting personality 
patterns. Dreikurs differentiates rivalry from 
competition (4, 5), rivalry being a struggle for 
superiority in specific situations or for limited 
goals, while competition is a pervasive and 
long-range struggle over the attainment of 
status within the family group. Moreover, 
rivalry need not display the “seesaw” phe- 
nomenon postulated above for competition. 
According to this distinction, rivalry may 
occur between any two individuals, including 
allies. 

This conceptualization of competition and 
its effect on personality development has found 
validation in clinical practice, as for example 
in therapy with “problem children’’: once the 
problem child improves with therapy, his 
previously “good” competitor usually displays 
problem behavior that is entirely novel for 
him. Rigorous experimental tests of these 
formulations have not yet been conducted, 
however. The present study therefore at- 
tempted to clarify to what extent siblings who 
are Allies and those who are Competitors, as 
defined by sameness or oppositeness of traits, 
do behave in the manner described by 
Dveikurs. For this purpose, sibling pairs were 
given three experimental tasks, and two types 
of response measures were utilized: (a) per- 
formance and (b) statements of aspiration. 
Two hypotheses were derived from Dreikurs’ 
position: 

Hypothesis 1. In a competitive relationship, 
a child’s performance, level of aspiration, and 
level of expectation rise when he is informed 
that his sibling has done more poorly than 
himself and drop when he is told that his 
sibling has done better than himself. 

Hypothesis 2. In an alliance relationship, a 
child’s performance, level of aspiration, and 
level of expectation drop when he is informed 
that his sibling has done more poorly than 
himself and rise when he is told his sibling has 
done better than himself. 
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On the basis of data to be presented, it was 
found that these hypotheses required sub- 
stantial modification. Differences did emerge 
between Competitors and Allies, but not as 
hypothesized nor did they occur on all tasks 
and response measures. Upon post hoc con- 
sideration of the task dimensions in the present 
study, it seemed necessary to take into ac- 
count two additional variables, ego-involve- 
ment and status-effect, in framing a more 
adequate test of the Adler-Dreikurs formula- 
tion. These variables are here so defined as to 
link on a common dimension ego-involvement 
and success-failure evocation, variables which 
in the literature are largely treated as discrete 
phenomena and which to the present writer 
appear conceptually related. Ego-involvement 
is here defined as occurring when S perceives 
the task or situation as important to the main- 
tenance or raising of his status, either because 
of the nature of the task or because of instruc- 
tions given to S.? Siatus-effect is defined as 
occurring when S experiences a shift in status. 
Thus, under conditions of failure S experiences 
a decrease of status, and under conditions of 
success § experiences an increase of status. 
Presumably, ego-involvement (calling status 
into play) heightens status-effect, although 
operationally ego-involvement and _status- 
effect are differentiable. Both for the purpose 
of the present study, as well as for a clarifica- 
tion of these problems in the literature, this 
kind of distinction may be found useful. 

Although ego-involvement and status-effect 
were not varied systematically in the present 
study, post hoc analysis of the experimental 
procedure and of the obtained data suggested a 
revised formulation that gives prominence to 
these two variables. This formulation is pre- 
sented at this point to facilitate the exposition 
of results along lines believed by the investiga- 
tor to be promising for further research. A 
basic hypothesis in the revised conceptualiza- 
tion is that differences between Allies and 
Competitors appear in a sibling comparison 
situation when ego-involvement and status- 
effect are present to a high or moderate degree 


? This 


aspects 


ego-involvement 
usually described in 


definition of covers more 
than are the literature 
and as such appears to have more face 
validity. It includes status, rather than 
merely threat to the maintenance of status, and it 
defines status both in terms of self-esteem and social- 


1O.8.. 6, © 


raising of 


esteem, for want of a satisfyingly clear delineation be 


«ween the two types of esteem 
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but not when they are minimally present. 
When these variables are present to a high 
degree, as they are in much of the real-life 
experience of the child, Allies and Competitors 
may be expected to behave as predicted by 
the original hypotheses, i.e., Allies strive to do 
like the sib, Competitors give up when the sib 
does better and only strive to outdo the sib 
when the sib does worse. When ego-involve- 
ment and status-effect are moderate, however, 
Competitors behave in a manner analogous to 
rivalry while Allies are relatively unaffected by 
the sib comparisons, being more attentive to 
their own than to their sib’s performance. This 
revised hypothesis leads to several ramifica- 
tions. For one thing, it suggests that the al- 
liance relationship is a weaker one than that 
of competition, stronger ego-involvement and 
status-effect being necessary for the Allies than 
for the Competitors to show a change in a 
situation of sib comparison. It further suggests 
that the type of behavior shown by Com- 
petitors is contingent upon whether the sib 
comparisons are presented in a context of 
moderate or of high ego-involvement and 
status-effect. Finally, the revised formulation 
does not predict difference between Allies and 
Competitors when presented with sib com- 
parisons under conditions of low ego-involve- 
ment and status-effect, even though both 
groups may use knowledge of the sib’s per- 
formance as cues for their own performance 
when they lack prior experience with the 
task. Goal-setting statements, possibly tapping 
motivational variables more directly than do 
performance measures, should show response 
changes more clearly than performance meas- 
ures in the sib comparison situation. It may 
therefore be hypothesized that whereas perform- 
ance and goal-setting statements differentiate 
the behavior of Allies and Competitors under 
conditions of high ego-involvement and status- 
effect, only goal-setting statements do so under 
moderate ego-involvement and status-effect. 


METHOD 


Subjects 


Two duplicate experiments were performed. Center 
Ss came from families where at least one sib was 
brought, within the past 18 months, to a child guidance 
center as a behavior problem; Normal Ss were selected 





*The clinics are part of the Community Child 
Guidance Centers of Chicago, a nonprofit organization 
under the sponsorship of the Individual Psychology 
Association of Chicago. 
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from churches in the community in which the university 
is located. Each experimental group was composed of 
20 sib pairs, 10 of Allies and 10 of Competitors, giving a 
total of 80 Ss or 40 sib pairs. All Ss came from families 
that had three or more children, Ss being between 
446 and 10 years of age. Allies and Competitors were 
matched by pairs for age and where possible for sex, 
approximately half the Ss in each group being under 
eight years of age and half being eight years or older. 
The Ss were selected on the basis of information ob- 
tained in a brief interview with the mother of each pair 
of sibs. The interviewer was a prominent Adlerian 
psychiatrist who was instructed to perform the inter- 
views in the manner usual in his psychiatric practice. 
His diagnoses based on the interview data were found, 
on analysis subsequent to the completion of the study, 
to follow a quantitative system, although he was not 
aware of this and followed what he considered was an 
intuitive approach in arriving at the diagnoses 

A pre-experimental test of the reliability of the in- 
terview method was performed. In this, the interviewer 
talked to twenty sets of parents without knowledge of 
which father and mother belonged together. To assess 
degree of agreement between the diagnoses derived 
from interview with the father and from interview with 
the mother, a 3 X 3 chi-square contingency table was 
prepared for the three diagnostic categories of Allies, 
Mixed, and Competitors. This yielded a contingency 
coefficient of .640, significant beyond the .01 level. 
(This coefficient is to be compared to the maximum 
possible for a 3 X 3 table, of .816.) In the interviews, 
the examiner first asked the parent which of the 
children are most different and which most alike, and in 
what regard. He then asked the parent to describe each 
child. If more information was needed, the interviewer 
asked about the following types of traits: who has the 
most (least) temper; who minds best (least); who is the 
best (worst) student; who is the most (least) outgoing. 

For the 80 Ss used in the experiment, £ did not know 
the diagnosis of Ss until the completion of the study, at 
which time the interview material was analyzed. This 
analysis revealed that when sibs showed a ratio of at 
least three traits opposite to one trait alike,.the diag- 
nosis was that of competition, and when the trait ratio 
of alike to opposite was at least three to one, the diag- 
nosis was that of alliance.‘ The 40 experimental pairs 
were drawn from a total of 104 diagnosed pairs, of 
which 10 pairs were not usable because one or both of 
the sibs were outside the required age range. The 
diagnostic distribution of the 104 pairs was 54 pairs of 
Competitors, 30 pairs of Allies, and 20 pairs of Mixed 
diagnosis. Selection of the final sample of 40 pairs was 
dictated by the requirements cf matching Allies and 
Competitors on a pair basis for age and where possible, 
for sex. Table 1 presents relevant characteristics of Ss. 


Tasks 


Iwo aspiration tasks and one verbal learning task 
were administered. Both Ss of the sib pair came to the 
laboratory at the same time, each S being put in a 


‘For a fuller account of the interview method, the 
testing of its reliability, and of the system of weighting 
traits to arrive at the trait ratios, the reader is re- 
ferred to the original doctoral dissertation (6) 
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TABLE 1 
CHARACTERISTICS OF SIBLING Parrs 








Normal Sample Center Sample 





Pair 


Compet- ajiies Compet- aniies 
itors 
Number of Pairs 
PF, — Fe* 4 3 1 5 
F; — M: 2 3 2 i 
M; — Fs 2 2 3 i 
M; — M: 2 2 4 3 
Total 10 10 10 10 
Years” 
Mean age 7.23 7.20 | 7.60 7.68 
Mean age difference 3.40 3.75 | 2.60 2.95 
| 


Mean Ratio 
Mean ratio of similar to op- 
posite traits 


67/4.20 4.08/.35).07/3.47 4.13/.57 





* F, — Fs is to be read that both older and younger siblings are 
female; F; — Ms, that the older sibling is female, the younger, 
male; etc. 

> Years are computed in half-year units 


separate room. Each task was presented alternately to 
each sib, and Ss had no contact with one another 
throughout the experiment. The only information each 
sib had of the other’s performance was that supplied by 
E. While E tested one S, the sib played with an assistant 
in another room. Time intervals between tasks were 
arranged to control for warm-up and transfer effects and 
to make credible the information given to S regarding 
the sib’s performance. To ensure Ss’ cooperation 
throughout the duration of the experiment, all Ss were 
told prior to entering the laboratory that they would 
play some games with £, and for being so nice and 
coming to play the games S would get a small prize 
afterwards. So as not to interfere with the experimental 
conditions (i.e., the sib comparisons made to S) only a 
minimal amount of encouragement was given to Ss 
throughout the experiment. 

The first task presented to each S (Aspiration Task 
A) consisted of 40 pairs of dots. The Z demonstrated 
how the dots could be connected, and then told S that 
within one minute most children of his and his sib’s 
age could finish putting lines through the dots for 20 
pairs of dots, but that the sib was real fast (slow), 
however, and could finish 30 (or 10) pairs of dots before 
time was called. Each S was asked how many pairs of 
dots he would /Jike to finish (Level of Aspiration, here 
designated LA) and how many he thinks he can finish 
(Level of Expectation, here designated LE). Half the 
sib pairs were given the 30 comparison and half the 10 
comparison. After stating his LA and LE, S was toid 
that this was just a practice game and that he could 
finish it on completion of all the games. S was not 
allowed to perform on this task so that performance and 
aspiration on the neutral trials of subsequent tests 
would not be interfered with. The purpose of Aspiration 
Task A was to determine the LA and LE under the 
conditions of S’s having no previous knowledge of his 
own skill. 

Subsequent to Aspiration Task A, S was shown the 
first list of Verbal Learning Task B. This task consisted 
of three lists of pictures, the first six pictures in a list 
being presented to those Ss under eight years of age, 
all eight pictures in a list being presented to Ss eight 
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years and older. The pictures were India-ink tracings of 
the Stanford-Binet 
and 


objects, chosen from 
Pictures Vocabulary series (Year I-IV, Form L 


from a series used by Chapman (3). Each list of pic 


common 


tures was presented to S for serial learning, the in 
ratrial interval being five sec. and the intertrial in 
terval ten sec. The two response measures obtained for 
each list on this task were 
one perfect trial (a perfect trial being the correct an 
ticipation of each picture in the list within the five-sec 


intratrial interval), and number of errors made during 


number of trials to learn to 


the learning trials. List 1 was presented as a neutral list, 
whiie Lists 2 and 3 were preceded by information given 
to S regarding his sib’s performance. Half the sib pairs 
were told before being shown List 2 that the sib had 
done better than S in learning the previous list, and told 
before List 3 that the sib had done worse. The reverse 
sequence of sib comparisons was given to the remaining 
sib pairs. When all three lists were presented under the 
neutral condition in a test 
children aged five to ten years, the means and SDs of 


pre-experimental with 
the three lists were found to be comparable 

The third task, Aspiration Task C, was a pegboard 
11 rows of 60 holes each. For each trial, S had to 
insert as many pegs in a row as he could within 60 sec., 


with 


using only one hand and picking up one peg at a time 
S$ was given two practice trials and nine test trials. A 
trial consisted of S’s stating his LA and LE and then 
filling in the holes. After three test trials, and preceding 
Trials 4 through 6, half the sib pairs were told the sib 
had done better than S; preceding Trials 7 through 9 
these Ss told the sib had than S 
[he reverse sequence of sib comparisons was given to 
Standardization of this task 
under neutral conditions with children aged five to ten 


were done worse 


the remaining sib pairs 


years, showed performance on the test to be reliable 


the obtained Kuder-Richardson r being .972) and to 


suffer from only minimal practice effect 


The twenty Ss used in standardizing the latter two 


were drawn from pediatric services at a large re 
Illinois Research Hospital, part of the 


All Ss, prior to 


h hospital 


t 
University of Illinois medical school) 
their participation in the present study, were judged by 
one or more of the medical or nursing staff to be of at 

intelligence and free of gross personality 


They 


patient convalescents (in a ward run as a kind of play- 


least average 


maladjustment were either nonbedridden in 
ground, so that persorality effects from hospitalization 
appeared to E 


coming for their routine pediatric checkup. From their 


minimal for these Ss) or outpatients 
. ‘ ane ete hel ; . ‘ 
hospital records and their behavior in the experimental 
situation, 
parable to the 


these Ss appeared to be adequately com 
experimental Ss 
Procedure 


In the first task, the 
ments telling S how the sib did in comparison to a 


“sib comparisons were state 
norm, while in the other two tasks the comparisons 
the sib did Although the 
wording of the original hypotheses covered specifically 
latter 
former condition was included, and in subsequent dis 


stated how in relation to S 


mly the 1ation, for purposes of study the 


cussion “sib comparisons” will be used to describe both 
types of comparisons. In summary, the procedure was 


as follows: 
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1. E played 2 min. with S; 
2. S; was given Aspiration Task A (sib completing 

10 (30) pairs of dots). 

en Aspiration Task A 

10 (30) pairs o1 dots 
4 Si learned List 

(neutral condition) 

List 1 of Verbal 


3. Se was ¢ sib completing 


1 of Verbal Learning Task B 


5. Se learned 
neutral condition 
6. S; was given Aspiration Task ( 
trials, three trials sib doing better 
tria!s sib doing worse |better]}). 


Learning Task B 
(three neutral 
worse], and three 
7. S, was given Aspiration Task C (three neutral 


trials, three trials sib doing better [worse], and three 
trials sib doing worse [better]}) 

8. S,; learned List 2 of Verbal 
sib doing worse | better] 

9. So List 2 of 
sib doing worse | better! 

10. S, learned List 3 of 
sib doing better [worse]) 

11. S; did Aspiration 
praised and given his prize 

12. S: learned List of 
(sib doing better [worse] 

13. S_: did Aspiration 
praised and given his prize 


Learning Task B 


learned Verbal Learning Task B 
g 


Verbal Learning Task B 


Task A if he wished 


was 


Verbal Learning Task B 


Task A if 


he wished, was 


Comparison conditions were selected for each task 
by the previous assignment of each sib pair to one of two 
sequences: (a) the 10-Better-Worse sequence and (6) 
the 30-Worse-Better sequence. In the type “ st 
quence, S was told the sib finished 10 dots on Aspit 
ation Task A, the sib did better on Trials 4 through 6 in 
Aspiration Task C (and worse on Trials 7 through 9 
and the sib did worse on List 2 in Verba 
B (and better on List 3). Th 
type “a” 


Learning Task 
type — sequence Is the 


The Sf 


quence of comparisons between tasks was thus balanced, 


complet reverse of the sequence 
i.e., when the comparison was favorable for the sib in 
the first task 
A), the next comparison which S received was favorable 


(sib finishing 30 dots on Aspi ation Task 


to S (sib doing worse on Trials 4 through 6 on Aspir- 
ation Task C 
initially favorable again for the sib (sib doing better on 
List 2 on Verbal] Learning Task B). Half each of the 
Center Competitors, Center Allies, Normal 
petitors, and Normal Allies were assigned to the “a 
and the other half to the “bd” 
Within each group, the assignment of a given sib pair to 
the “‘a”’ or “db” 
tion being that no more than half of a 


, and comparison on the final task was 


Com 


condition condition 
condition was random, the only limita 
given group 


would fall into one or the other sequence condition 


Response Measures 


It will be recalled that in Aspiration Task A no 
performance measures were obtained and that only one 
trial was presented to S. LA and LE raw scores are thus 
the data used for this task 

For Verbal Learning Task B no aspiration measures 
were obtained, the two performance measures being 
number of trials to learn to criterion and number of 
errors made during the learning trials. The data used 
for this task were change scores from the neutral! con- 
dition, computed by subtracting the score on the 
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score on the neutral list 


oved pertormance (ilewer 


yn list 


RESULTS 
ispiration Task A 
\s Table dif- 
ferences were obtained between Competitors 
and Allies, had 
\ificantly higher LE when told the sib finished 


seen from no significant 


although both groups sig- 
30 pairs of dots than when told the sib finished 


10. Withou 


1 groups used the sib’s performance as a 


prior experience in a task, thus, 
th 
reference point by which to set their own ex- 
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pectations. Their aspirations did not follow 
this pattern, however. The LA scores tended 
to be uniformly high, except for the Center 
Competitors who were told the sib finished 10 
dots, the low scores of this group primarily 
accounting for the significant LA interaction 
The fact that this sig- 
nificant interaction not found for the 
Normal Ss but only for the Center Ss, can 
possibly be explained as a function of the 


found for Center Ss. 
was 


counseling experience Center Ss have been 
exposed to. 


Verbal Learning Task B 


As seen from Table 3, no significant effects 
were found for this task. 


{spiration Task C 


No significant effects were found for the 
performance measure, although both LA and 
LE measures yielded significant differences 


between Allies and Competitors. As seen from 


TABLE 2 
Mean LA AnD LE scores ON ASPIRATION TASK A 
V per cell = 10) 
Norma mple Center Sample 
ry . G S Fas > Slow Sib “Fas Sib “Slow”’ 
P 4 B 
A F A) bi F 
Mean ) Me SD Mean SD Mea SD 
LA ( r S 27.9 12.60 | 31.0 | 6.63 33.5 6.73 | 22.9 10.47 
Allies 30.0 8.94 25.5 10.59 31.0 | 9.16 | 33.5 | 6.87 
Between L\&B 04 1.27 
Interaction 1.31 7.40" 
LI Competitors 24.2 | 9.15 19.3 5.29 31.3 | 6.80 | 19.8 | 9.17 
Allies 24.8 | 7.26 | 17.1 5.10 26.6 | 6.86 | 24.3 |11.72 
Between A & B 7.46** 5.44* 
Interaction 37 2.42 
"> 1 
rABLE 3 
VERBAL LEARNING Task B: MEAN CHANGE SCORES IN NUMBER OF TRIALS TO LEARN TO CRITERION 


AND IN ERRORS 
N per cell = 20) 
Normal Sample Center Sample 


Sib “Better” | Sib ““Worse” 


Response Measure Group Sib “Better’’ Sib ““‘Worse”’ 
Mean SD Mean SD Mean SD Mean SD 
l'rials* Competitors —.20 | 2.36 + .05 | 2.50 + 80 | 2.29 | +1.25 | 3.00 
Allies +.65 | 2.48 +.25 | 2.74 + .80 | 3.30} +.25 | 3.71 
Errors* Competitors -2.10 | 6.49 —-1.00 | 6.07 -.20 | 6.24} +1.50 | 8.07 
Allies +1.35 | 6.99 8.34 +2.00 | 9.62 —1.70 | 9.45 


+ .05 


* None of the obtained F ratios were significant 
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TABLE 4 
MEAN CHANGE ScoRES ON ASPIRATION TASK C 
(N per ceil = 20) 


Normal Sample 


Response Measure | Group 


Mean 
+ .00 
+ .68 


Performance Competitors 
Allies 
| (Between Competi- 
tor-Ally) 
| Competitors 
Allies 
(Between Competi- 
tor-Ally) 
Competitors 
Allies 
(Between Competi- 
tor-Ally) 


+ .60 


+ .07 


+3.10 


| Sib “Better” 


SD 


Center Sample 


Sib “Better” Sib ‘‘Worse”’ 


SD Mean 


Sib ““Worse”’ 
Mean SD 


Mean SD 


+1. : oan 51 
+ .87 5! 46 





* 9 <.01. 


* Both for Normal and Center LE, Bartlett’s test for homogeneity of variance gave x* significant beyond the .02 level 


Table 4, the Allies lowered and the Competi- 
tors raised their LA and LE under both the 
Better and Worse comparison conditions. Since 
heterogeneity of variance was found for the 
LE measure, a chi-square test suggested by 
Wilson (9) was used for this measure. This 
yielded a significant differeuce in LE between 
Competitors and Allies for oniy the Center Ss. 
Since the results were substantially in the 
same direction for both Center and Normal 
Ss, however, the conclusion is warranted that 
both LA and LE differentiated between the 
responses of Competitors and Allies when Ss 
were presented with sib comparisons. 

When the data of Table 4 are further ana- 
lyzed, the overall rise in score from the neutral 
condition seen in the Competitors’ LA and LE 
data turns out to obscure a more complex 
relationship. As shown in Table 5, for Com- 
petitors a significant interaction is obtained 
between Better-Worse and First-Second pres- 
entation on both the LA and LE measures. 
This interaction describes the fact that the 
two conditions, of § first being told his sib did 
better and of S secondly being told his sib did 
worse, produce a significantly greater rise in 
LA and LE than do the two conditions of S 
first being told the sib did worse and of S 
secondly being told his sib did better. This 


5 Since Ss were randomly assigned to the sequence 
conditions and since the pattern is similar for Center 
and Normal Ss, a selective S variable to account for the 
significant interaction (i.e., that Ss in the Better First- 


interaction pattern did not appear for the 
Allies. The presence of this pattern for Com- 
petitors suggests that the behavior of the 
Competitors may be characterized as largely 
that of rivalry: when Competitors hear that 
the sib does better, a set to outdo the sib is 
elicited and maintained even when subse- 
quently the sib does worse, this set to outdo 
the sib appreciably not being manifested 
when Ss first are told the sib does worse. 

The meaning of the Allies’ drop in LA and 
LE scores on being presented with the sib 
comparisons can probably be best understood 
by inspection of the data of Ss used to stand- 
ardize this task. As will be recalled, these Ss 
simply took their nine test trials in the manner 
of the neutral trials of the experimental Ss, 
i.e., no comparisons being presented. These 
standardization Ss dropped their LA and LE 
scores from the first three trials to each set of 
subsequent three trials. In other words, merely 
performing the task lowers the LA and LE in Ss 
of this age, the goal-setting statements be- 
coming closer to the actual performance as a 
function of increased experience in the task. 
As seen in Table 6, Allies are comparable to 
the standardization Ss in their drop of LA and 
LE from the first three trials to the subsequent 
sets of three trials, the introduction of the sib 


Worse Second sequence are somehow different from Ss 
in the Worse First-Better Second sequence) does not 
appear to be tenable. 
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comparisons after the first three trials produc- 
ing behavior in the Allies that is substantially 
no different from the behavior of Ss who were 
given no comparisons of any kind. This picture 
is not found for the Competitors, whose change 
in LA and LE scores from the first three trials 
differs significantly from the change scores 
obtained for the standardization Ss. From this 
analysis it can be reasonably concluded that 
Allies, in contrast to the Competitors, were 
largely unaffected by the sib comparisons, 
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paying heed primarily to their own perform- 
ance in a manner analogous to Ss who never 
were presented with any comparisons. The 
Allies’ lack of apparent concern over the sib’s 
performance suggests that the ally relationship 
may be weaker than that of competition, 
stronger cues being necessary to elicit response 
change in a sib comparison situation. This 
suggestion, although not specified in the 
original hypotheses of the present study, is 
consonant with the Adlerian theory. 


TABLE 5 
SEQUENCE EFFECTS FOR COMPETITORS IN ASPIRATION Task C 
(Scores are mean change scores, N per cell = 10) 
First Comparison Condition Second Comparison Condition 
Response Sib “Better” | Sib “Worse” Sib “Better” | Sib “Worse” | 
1 ae Sample First First Second | Second 
° (A (B I (A) (B) F 
Mean | SD | Mean | SD Mean | SD | Mean | SD 
LE* Center +1.17) 3.58| —.20) 2.78 — .63| 2.88 +2.50 4.25 
Normal +1.17) 3.87, —1.17| 4.54 + .03) 3.76, +3.27) 4.77 
Between A & B 1.69 7.35" 
LA 
Center +3.93 7.56 —1.50 9.53 +2.10; 3.22) +2.30) 5.15 
Normal +6.17,| 9.69 +.20, 4.49 + .03, 6.37) +4.77| 9.26 
(Between A & B $.55° 4.28* 
. e< os 
"p< 01 


* When Center and Normal scores were combined, F for 
>< 01 


interaction of Better-Worse X First-Second Presentation = 7.59 (for 1,72 dj, 


© When Center and Normal scores were combined, F for interaction of Better-Worse X First-Second Presentation = 5.64 (p < .05) 


Although variances were heterogeneous in the F test for interaction, a ¢ test which compared Better First-Worse Second scores with Better 


Second-Worse First scores (Center and Normal combined 


for the 


test 


upheld the significant interaction even though variances were homogeneous 


TABLE 6 


MEAN CHANGE ScorEs 1n LA AND LE on AsSprraTION TAsK C As A FUNCTION OF TRIAL SEQUENCE* 


Second Set of Three Trials! Third Set of Three Trials 


Re sponse 


Measure Group 


Mean 
LA Competitors +2.20 
Allies —1.98 
Standardization 2.37 

Between Competitors & 

Standardization 

Li Competitors + .24 
Allies 84 
Standardization -2.28 


Between Competitors & 
Standardization 


Minus First Three Trials 


Minus First Three Trials F 
SD Mean SD 
8.64 +2.30 6.61 
6.93 -2.04 8.92 
8.24 3.89 10.25 
8 .39** 
3.81 +1.29 4.30 
7.00 -1.69 9.50 
11.06 3.29 11.25 
>.16* 


* » < .05. Although variances are heterogeneous, the similarity of trend between LA and LE measures suggests the generality of the 


Competitor-Standardization difference 


5 < .01 


® For the experimental Ss, Better and Worse conditions and Center and Norma! Ss are 


for each of the experimental Ss 


combined for each cell. V per cell thus = 40 


V for each cell of the Standardization Ss = 20 
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TABLE 7 


MEAN Scores ON First THREE 


(NEUTRAL) Test TRIALS IN ASPIRATION 


Task C 


(N per cell = 20) 


Performance 
Group 2 
Mean SI 


Competitors 
Normal 
Center 

Allies 
Normal 
Center 

otandardizé 


Since Allies and Competitors differ in 
their LA and LE once the sib comparisons are 
presented to S, it could be argued that the 
groups simply differ in their general aspiration 
behavior, the sib comparisons being of inci- 
dental importance. Inspection of Table 7, 
however, reveals that Competitors and Allies 
are comparable in LA, LE, and performance 
on the neutral trials (and are similar to the 
standardization Ss), thus ruling out any basic 
difference between the two groups in general 
goal-setting behavior. 
DISCUSSION 

The data failed to confirm the original 
hypotheses of the study. It will be recalled 
that a revised formulation, the 
original Adler-Dreikurs conceptualization, was 
suggested as a result of post hoc analysis of the 
data and task dimensiens of the present study. 
This formulation considered ego-involvement 
and status-effect as crucial variables in an 
experimental test of the original hypotheses. 
In the present investigation, the attempt was 
all status manipulation to 


based on 


made to limit 
merely providing S with information of the 
sib’s performance, all other sources of ego- 
involvement and status-effect being mini- 
mized.* However, on the’ basis of the two 


6 Although the Adlerian theory of sibling relations 
describes the relationships of Alliance and Competition 
as developing as a function of the child’s striving for 
status (4), it was not recognized until the completion of 
the present study that manipulation of the status 
dimensions was an essential component to testing the 
theory. Thus, for experimental rigor, the elicitation of 
status variables was explicitly limited to merely 
providing S with information about the sib’s per- 
formance, so that censure and praise, emphasis of 
success and failure, etc., were purposely minimized so 
as not to confound the effects of the sib comparisons. 
tasks were chosen both for the 


Moreover, neutral 


LE 
Mean 


27.97 


26.63 
28.55 
26.65 
29.12 


variables of ego-involvement and status-effect, 
it is possible to distinguish Aspiration Task A 
from Aspiration Task C: Aspiration Task A 
only elicits ego-involvement and not status- 
effect since the information given to S com- 
pares the sib to a norm and not to S’s own 


performance; in Aspiration Task C, in con- 
trast, the attempt is made to evoke status- 
effect by telling S the sib did better or worse in 
comparison to S’s own performance, the com- 
parisons either deflating or inflating S’s status. 
In addition, Aspiration Task C can be assumed 
to be more ego-involving, if only because Ss 
actually entered the task. 

The revised formulation takes into account 
this distinction between the two tasks. It 
postulates that under conditions of moderate 
ego-involvement and _ status-effect (as in 
Aspiration Task C) Competitors in their goal- 
setting behave in a manner analogous to 
rivalry, while Allies are relatively unaffected 
by the sib comparisons, paying heed primarily 
to their own rather than the sib’s performance. 
The latter is taken as indicating a weaker 
relationship for alliance than competition. 
The revised formulation further postulates 
that under low ego-involvement and status- 
effect (as in Aspiration Task A) the two groups 
do not differ significantly, although both 
groups may use information of the sib’s per- 
formance as a cue for their own strivings if 
they do not have their own performance 
available as a cue. The fact that Verbal Learn- 
ing Task B failed to show significant differences 
between Allies and Competitors is interpreted 
as in line with the lack of significant effects 


purpose of mot inducing strong success or failure ex 
periences other than those associated with the sib com- 
parisons, and also so that prior experience with a task 
wou'd not confound the results. 
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found in the performance measure on Aspira- 
tion Task C. One explanation for the lack of 
significant effects in the performance measures 
is that goal-setting possibly taps motivational 
variables more directly than do performance 
measures, so that under conditions of moderate 
ego-involvement and status-effect (as in Tasks 
B and C) performance measures fail to distin- 
guish between the behavior of the two groups. 
Under high ego-involvement and 
status-effect, however, significant differences in 


degree of 


performance should be obtained, as stated in 
These hypotheses, 


the original hypotheses. 
drawn from the real-life experience of the child, 
which presumably contains a high amount of 
ego-involvement and status-effect, cannot be 
said to have been tested in the present study. 

The revised formulation awaits verification 
from future studies in which ego-involvement 
and status-effect are systematically manipu- 
lated. The various ego-involving 
instructions and tasks, as well as of status-effect 


r¢ les of 


evoked by censure-praise, failure-success de- 
fined by means other than the sib comparisons, 
etc., can then be judged in their importance to 
the response changes of the two groups. Allies 


+ 


and Competitors who are compared to a 


stranger introduced for the purpose of the 
experiment might alsu profitably be included 
in further research. According to Adlerian 
theory, Allies and Competitors would not differ 
from one another when compared to a stranger 
as when compared to the sib, for it is postulated 
that Allies and Competitors do not intrinsically 
differ in fundamental personality patterns but 
that they differ in behavior only in response to 
the sib with whom the particular relationship 
is established.’ Data from the present study 

? That is to say that although a given pair of Allies 
will be alike in personality and a given pair of Com 
petitors opposite in personality, a random group of 


r 
Allies will show the same range in personality traits as a 


random group of Competitors. Thus, out of four Allies, 
one pair may be striving and dominant and another 
ng and submissive, while in each of two pairs 


sib may be striving and 


pair retir 


dominar 


f Competitors one 


er sib retiring and submissive. The resul 


and the ot! 


will be that two out of 


t 

t 

the four Allies will be like two out 
1 other two Allies will 

the other two Competitors In this example, it 


of the four Competitors, and the 
be li e 
would be reasonable to expect that the four dominant 
Ss will act differently toward a stranger than the four 
Such data would therefore contrast to 
t study in which the sid was the com 


object, in that responses to 


submissive Ss 
that of the preser 


pared 1 stranger would not 
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indicate that the two groups were actually 
closely comparable,’ suggesting that the change 
in goal-setting noted in Aspiration Task C 
can be attributed with fair certainty to the 
fact that the sib was the compared object. 

In conclusion, it must be conceded that the 
present study leaves several unanswered 
questions. But the fact that siblings, selected 
solely on the basis of their likeness or opposite- 
ness of personality traits, differ significantly 
in goal-setting, when the sib’s performance is 
used as the only source of status-effect, sug- 
gests the fruitfulness of further experimental! 
investigation of the Adlerian theory of sibling 
relations. 


SUMMARY 


The present study set out to test hypotheses 
derived from the Adlerian theory of sibiing 
relations. Two duplicate experiments were 
performed, 20 sibling pairs being drawn from 
Child Guidance Centers and 20 pairs from a 
“normal” population. Each group contained 
10 pairs of Allies and 10 pairs of Competitors, 
the operational definition of alliance being 
sameness of traits of the siblings and competi- 
tion being oppositeness of traits. Diagnosis of 
the sibling relations was based on interview 
material obtained from the mother of each 
pair of sibs. Each S was tested singly and was 
given two aspiration tasks and one verbal 
learning task. On each task, S was given pre- 
determined information about the sib’s per- 
formance. Although the hypotheses tested 
by the study were not upheld, Allies and 
Competitors were found to differ significantly 
in their goal-setting when the sib’s performance 
was compared to S’s own performance. A 
formulation that revised the original hypothe- 
ses of the study was suggested, with the var- 
iables of ego-involvement and status-effect 
being considered crucial to an adequate test of 
the original hypotheses. Although the present 
study leaves several unanswered questions, 
differentiate between the behavior of Allies and Com- 
petitors 

* Evidence for the comparability of the two groups 
derives from the following: (a) the two groups had com- 
parable scores on the neutral trials of Aspiration Task 
C; (6) the interview data revealed the personalities of 
the two groups to be quite similar; (c) many of the 
Allies had sibs with whom they had a competitor re- 
lationship while several of the Competitors had sibs 
with whom they had an alliance relationship 








the significant results that were obtained sug- 
gest the fruitfulness of further experimental 
investigation of the Adlerian theory of sibling 
relations. 
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EFFECTS OF PUBLIC AND PRIVATE FAILURE ON 
SELF-EVALUATION' 


EZRA STOTLAND* anp ALVIN ZANDER 


Research Center for Group Dynamics, University of Michigan 


PERSON who has failed may as a result 

lower his evaluation of himself, under 

certain conditions, yet under other 
conditions he may not pay much attention to 
the failure. Among the circumstances that may 
affect the nature of his reaction are his opinions 
about how others evaluate his performance, 
and these opinions are determined by whether 
the others know about the failure. 

It is evident that a person who has failed 
tends to become sensitive to the opinions of 
others. This has been shown in several em- 
pirical studies (2, 3, 4). It is known, too, that a 
person’s opinion about himself is affected by 
the evaluations of his performance that he at- 
tributes to others (4). If the failer assumes that 
others think poorly of his performance, his 
feelings about himself are likely to depend upon 
how much he attributes validity to the others’ 
judgments about him. 


In this study we seek to answer three ques- 
tions. Two of these are: Will a person’s evalua- 
tion of himself when he fails in complete 
privacy be different from his evaluation when 


he is aware that another knows about the 
quality of his performance? Will a person’s 
evaluation of himself in the presence of an in- 
competent judge of his ability be different 
from his rating when the other person is a 
competent judge? 

In attempting to answer these questions, we 
take it as given that a person at all times wishes 
to evaluate himself as highly as it is possible 
for him to do, within the limits set by the facts 
as he sees them. When an individual fails on a 
task and clearly knows that he has done so, he 
cannot deny his failure to himself without 
grossly distorting reality. Thus, he is not 
likely to deny failure in such an instance. But 
where different degrees of failure are possible 
and the degree of his failure is uncertain in 
his own mind, he can deny that he has failed 
at all, or minimize the degree of the failure. 

1 The work on this study was financed by a grant- 
in-aid (M-701 (C)) from the National Institute of 
Mental Health of the National Institutes of Health, 
Public Health Service. 

2 Now at University of Washington 


A person who has failed has no reason to 
believe that other individuals who are aware 
of his failure, especially strangers, minimize 
the extent of that failure. Thus, we assume that 
a person who has failed believes that others 


will tend to evaluate his performance as worse 


than he would like to believe it was. 

If a failer, then, conforms to the evaluations 
he assumes another has of him, he will evaluate 
his performance low. What will lead him to 
accept a stranger’s negative evaluations? First, 
it is necessary that the other know about the 
failure, since unless the other is aware of the 
poor performance, the failer has few constraints 
against minimizing his failure to himself. In 
addition, for him to conform to the other’s 
negative evaluation, the other must generate 
social pressures on the one who has failed, 
pressures which are strong enough to overcome 
the failer’s tendency to set his self-evaluation 
as high as possible. It is assumed that stronger 
social pressures are exerted by persons who are 
experts than by those who are nonexperts. 

Therefore, it is first hypothesized that, 1(a) a 
person whe had failed on a task evaluates the 
degree of his failure as being greater when his 
failure is observed by an expert than when it 
is not observed by an expert; 1(6) the observa- 
tion or nonobservation by a nonexpert, how- 
ever, does not generate variations in self- 
evaluations, but 1(c) the person evaluates his 
performance as poorer when his failure is ob- 
served by an expert than when it is observed 
by a nonexpert. 

Thus far, the individual has been considered 
as evaluating only the quality of his perform- 
ance on a task, as good or poor. He may in 
addition make evaluations of personal abilities 
that were involved in the failure. Because his 
evaluation of his abilities is likely to be af- 
fected by the evaluation of his performance, 
his rating of the abilities will probably be simi- 
lar to his evaluation of his performance. The 
three hypotheses described above, therefore, 
apply equally well to evaluations of abilities 
involved in the person’s performance on the 
task. 
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We conceive abilities as being either closely 
related or less closely related to the ones under 
test. Further, we assume that the abilities 
most closely related to those being employed 
at the time of his failure will be affected by 
the level of that evaluation, while those less 
closely related will be less affected by the 
evaluation. We raise then a third question: 
How much of the person’s self is included in 
his evaluations of himself after an experience 
of failure? A second hypothesis is proposed: 
Where a person gives a low evaluation of his 
performance, because of observation of his 
failure by an expert, the evaluation of closely 
related will also be low while the 
evaluation of less closely related abilities will 


abilities 


not be as low 
METHOD 


These hypotheses were tested in a labora- 
tory experiment. The Ss were 126 male under- 
graduate volunteers. As each arrived in the 
laboratory he was privately given a puzzle 
which was described as a test of visual motor 
coordination, with completion of the puzzle 
criterion of success. Before the Ss 
E introduced himself to half of 


as the 
worked on it, 
them as an expert on such tests; to the other 
half as a nonexpert. The Ss worked in privacy 
on the task, which was impossible to com- 
plete. Then, in half the cases, E entered the 
them that he observed 
they had failed (public condition); in the 
other half, he asked the Ss, while outside their 
rooms, to put thé test material away without 


1_J 
told 


Ss’ rooms and 


leaving any indication of how well they had 
done (private condition). All the participants 


then privately answered a questionnaire in 
which they evaluated their performances on 
the task, their visual motor coordination, and 
their abilities in areas varying in relevance to 
the task. The separate phases of the experi- 


ment are now described. 


Experimental Task 

The Ss arrived individually in the laboratory and 
E escorted then 
where they remained for the rest of the experiment. 
Each S was told he was to work on a puzzle to help 
himself so that 


immediately to private cubicles 


him discover things about he could 
perform better in the later or “‘real’’ parts of the ex- 
periment, although he was not told what these “real” 
parts would be. He was told that working on these 
tasks would teach him something useful about himself. 
The purposes of these instructions were to arouse the 


Ss’ interest in self-evaluation, to motivate them to 
work on the puzzle, and to interest them in answering 
the postexperimental questionnaire for their own 
benefit, not as a communication to a research worker 
In this way, it was hoped to obtain more honest and 
valid answers. 

The first task was a puzzle which was presented as 
“an excellent test of visual-motor coordination.” 
Visual-motor coordination was defined for the S as 
the “ability to imagine how certain shapes and forms 
fit together and then to take this arrangement and 
put it down in actuality.” Thus, the test was made 
relevant to certain specified abilities. It was assumed 
that visual-motor coordination was an aspect of them 
selves on which these persons did not have fixed evalu- 
ations and was, therefore, subject to experimental 
influence. The puzzle consisted of 12 irregular card 
board pieces with which S was to fill the space inside 
an irregular outline traced on a board. He was allotted 
ten minutes. The criterion of complete success was 
said to be full completion of the puzzle. S was told 
that degrees of failure were determined by 
crepancy between the achieved arrangement and the 
one possible way of completing the puzzle. The number 
of pieces employed in the design were described as 
irrel-ant to his degree of failure. In this way the ob 
jective degree of failure was made ambiguous for the 
S, allowing social factors more influence on his evalu 
ation of his performance. 

Before S was allowed to work on the puzzle, EZ 
showed him his second task, the postexperimental 
questionnaire. This was left in the cubicle for him to 
fill out after the time allotted for work on the puzzle 
had been spent. The private nature of his answers was 
emphasized again by telling him that answers were 
not of interest to the EZ and by repeating some of the 
earlier instructions 


the dis- 


Experiness Induction 


Before S worked on the puzzle, Z introduced him 
self in haif the cases as Dr. X who had much experience 
constructing tests of visual 
condition). In the other 
substitute for the 


in administering and 
(expert 
himself as a 


motor coordination 
half, £ introduced 
regular experimenter and as one who knew little about 
these tests (nonexpert condition 


Public-Private Induction 


The S was asked to work privately on the task. The 
E announced that the allotted time had elapsed while 
standing outside the S’s cubicle. In the public condition, 
E entered the cubicle and said impersonally, “I see 
that you were not able to complete the task. I’m afraid 
this would have to be called a failure.” He then in- 
structed S$ to put the pieces of the puzzle back into 
the envelope in which he had found them and to go on 
to the questionnaire. In the private condition, Z did 
not enter the cubicle but instead instructed S to put 
the pieces back into the envelope in which he had 
found them and to go on with the questionnaire. It 
should be remembered that in both conditions the 
Ss knew they had failed to some degree without Z’s 
saying so, since they had been told that the criterion 
of full success was completion of the puzzle. 





PUBLIC VERSUS PRIVATE FAILURE 


2 was employed with 


public-non 


<x 2 design 
public-expert, 


Thus a four-cell 
the following conditions 


expert, private-expert, and private nonexpert 


Post-ex perimental Questionnaire 


o develop a series of items about self-evaluatior 
legrees of similarity to the abilities 

mental task, ; 

npublished) of the traits that are 

} 


list was obtained 


y male undergraduates in de 
This list was ranked a priori by 

in respect to their degree of similarity 

s needed for the experimental task. To 
riori ranking so that it corresponded to 


at would be made by an average male 
modification of a method used by 


mployed, using male undergraduate 
the experiment 
on the questionnaire asked S to rate 


1 t if he felt that he had 


A i-point scaie 


puzzle ond question 


ire, the S answering one but not 


rhe se was a 


had failed. In this way his evalu 
fp 


ince on the puzzle was measured 
| on of his visual-motor coordina 
1 as VMC) and of a series of 
order of decreasing similarity 

in the experimental task 

he results, this 13-item scale was 
and 5 items each and 


f 4,4 
} 


the mean « luation attributed to each 


aivi 
group was 
hereafter as 
scripts designating the order 


These means are referred to 


ind M3, the su 
,y cin to the abilitie 1 “d 1 
to the abilities required in the 


»htained 
Mi, M2 
of decreasu imilarit 
experimental task 
The ratings given to VMC and M1 are assumed to 
licate evaluations of abilities that are closely related 
to the ised in performing the task, while evalu 
ations of M2 and 
lated to the task-required abilities. In addition 
a final question requested that the 


ones 


M3 are assumed to concern areas 


to these 13 items 
S evaluate himself as a “complete person.” 

ly, they were asked to rate the competence 
on an 11-point scale. This question was in 
extent to which Ss 
in these 


in order to determine the 


serted 


be an expert or a nonexpert 


RESULTS 


To determine whether the £ was perceived 
by the Ss as being either expert or nonexpert, 
the mean ratings of the competence they at- 
tributed to E in the post questionnaires were 
computed. The mean for the expert condition 
was 7.19, and for the nonexpert condition 5.56, 
significantly different at the .001 level of 
probability (¢ = 3.62). It is evident that the 
Ss on the whole accepted E’s descriptions of 
his expertness in the separate conditions; but 
those who were told that the E was not an 
expert attributed more competence to him 
than might be expected 


TABLE 1 
MEAN SELF-EVALUATIONS FOR VARIOUS 
EXPERIMENTAL CONDITIONS 


Experimental Conditions 

| Private — 
Non : 

| expert 
N = 28 


Col. d 


Private 
Expert 
V=34 
Col. c 


liffs.* 


Performance 
puzzle 
Visual motor 
linatior 
Abilities M! 
Abilities M2 


tail ¢ tests 


The mean self-evaluations for their per- 
formances on the puzzle and for their separate 
abilities are contained in Table 1. These data 
provide no support for hypotheses 1(@), 1(6), 
or 1(c) in respect to evaluation of performance 
on the puzzle. In fact, contrary to what was 
anticipated in hypothesis 1(6), the Ss who 
were observed by a nonexpert evaluated their 
performances more poorly than those who 
were not observed by him, the difference being 
significant at the .001 level. 

The evaluation given to their visual-motor 
coordination (VMC), is lower in the public- 
expert condition than in the private-expert 
condition (p of diff. = .06), as would be ex- 
pected by hypothesis 1(a), and observation or 
nonobservation by a nonexpert generated no 
difference in evaluation of VMC as would be 
expected by hypothesis 1(6) These data can- 
not be accepted as support for hypotheses 
i(a) nor for 1(6), however, since it is apparent 
that some of these Ss did not behave as was 
predicted in evaluating their performances, 
and the ratings of VMC depend upon a gen- 
eralization of the evaluation of their perform- 
ances to their visual-motor coordination as an 
ability. The evaluations assigned to M1 
abilities were approximately the same in all 
conditions. It is apparent, then, that the data 
in Table 1 provide no convincing corrobora- 
tion for hypotheses 1(a), 1(5), or 1(c). 

In order to control on the effects of the 
evaluations given to their performances on the 
puzzle in determining the evaluations of VMC 
and M1 abilities, the total sample was divided 
into two groups at the median of their evalua- 
tions of their performances on the puzzle. 
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TABLE 2 
MEAN SeLF-EVALUATIONS FOR VARIOUS EXPERIMENTAL CONDITIONS FOR SUBJECTS WITH HIGH AND 
Low EVALUATIONS OF PERFORMACE ON PUZZLE 


Experimental Conditions 


Public Public -rivate Private 
Expert Nonexpert Expert Nonexpert Total 
N = 11 N = 12 N = 18 N = 16 » value of 
Item Evaluated Col. a Col. b Col. « Col. d Col. e diffs.* 
Part A—Subjects with high evaluation of performance on puzzle 
Visual motor coordination 3.64 5.15 5.17 5.00 4.83 a-c 03 
a-—b .03 
Abilities M1 5.03 5.87 5.97 5.83 5.73 a—b .05 
a-c .05 
Abilities M2 6.600 7.03 6.87 6.86 6.85 
Abilities M3 6.15 6.68 6.63 6.00 6.26 
Complete person 6.18 7.09 6.72 6.44 6.61 
N = 13 N = 19 V=9 V=9 
Part B—Subjects with low evaluation of performance on puzzle 
Visual motor coordination 4.92 3.42 3.89 3.00 3.56 
Abilities M1 §.15 4.49 4.80 4.28 4.68 
Abilities M2 7.13 6.14 6.44 6.44 6.51 a-b .001 
Abilities M3 6.52 5.97 6.02 5.96 6.12 
Complete persen 6.08 6.11 6.56 5.22 6.20 
Note.—Value of ? for differences between comparable cells in Paste A and B of Table 2: VMC, Col. b, .03; VMC, Col. d, .03; VM¢ 


Col. e, .001; M1, Col. b, .001; Mi, Col. c, .05; M1, Col. d, .03; M1, Col. e, .001 


® By two-tail ¢ tests 
lower 


Then, the means of the evaluations given to and M1 abilities are rated significantly 


their separate abilities were calculated. These by Ss who were in the public-expert condi- 
are reported in Table 2 tions than by those who were in the private- 
It can be seen there that those Ss who expert condition. This is true, however, only 





minimized their failures and gave better eval- 
uations to their performances (Part A), tended 
in the large majority of cases to give higher 
evaluations to their abilities than did Ss (Part 
B) who rated their performances as poorer. 
There is a strong tendency, it is clear, for per- 
sons who think better of their performances 
to also think better of their abilities. This 
tendency is evident in every experimental 
condition, except the public-expert one. 

In the public-expert condition, the Ss who 
had rated their performances high evaluated 
their VMC and M1 abilities as low as did the 
Ss who had rated their performances low. We 
return later to a discussion of this discrepancy 
in the evaluations of performances and abili- 
ties in the public-expert condition. 

The low evaluations of the VMC and M1 
abilities by Ss in the public-expert condition, 
but not in the other conditions, provide data 
relevant to the first hypothesis. 

The (a) part of Hypothesis 1 asserts that 
persons make lower evaluations of themselves 
when their failures are observed by an expert 
than when not observed. It is clear that VMC 


for those who evaluated highly their perform- 
ances on the puzzle. 


The (6) part of Hypothesis 1 states that 
where the £ is a nonexpert, persons do not 
evaluate themselves differently when their 


failure is observed as compared to when it is 
not observed. The hypothesis is supported 
both by those who rated their performance 
high, and by those who rated their perform- 
ance low. 

The (c) part of Hypothesis 1 says that ob- 
servation of the Ss’ failures by an expert 
generates lower self-evaluations than does 
observation by a nonexpert. It is evident that 
this hypothesis is supported by the responses 
of those Ss who gave themselves high evalua- 
tions in their performance on the puzzle, but 
is not supported by the Ss who had low 
evaluations of their performance on the 
puzzle. 

It is clear, then, that two of the three parts 
of the first hypothesis are supported by rat- 
ings given to VMC and M1 abilities only 
where the Ss felt that, or stated that, they 
had performed well on the puzzle. 


PUBLIC VERSUS PRIVATE FAILURE 


Among the Ss who made low evaluations 
of their performances (Part B of Table 2) 
there are no significant differences in the 
evaluations of VMC and M1 abilities when the 
means of the four experimental conditions are 
compared. Perhaps these Ss evaluated their 
performances on the puzzle, and their VMC 
and M1 abilities, so low that the pressure on 
them, stemming from the E to lower their 
evaluations of themselves could cause no fur- 
ther drop in self-evaluations regardless of the 
strength of these pressures. Where the Ss had 
made high evaluations of their performances 
on the puzzle, the height of these evaluations 
apparently provided “room” for the social 
pressures effectively to lower evaluations of 
closely related abilities. 

Hypothesis 2 that person 
evaluation of his performance 


states where a 
gives a low 
because his failure has been observed by an 
expert, the evaluation of closely related abili- 
ties will also be low, while the evaluations of 
less closely related abilities will not be as low. 
The results in Table 1 inconclusive in 
respect to this hypothesis, although they are 
in the predicted direction. 

In Part A of Table 2, as we have seen, the 
abilities which are most closely related to 
performance on the puzzle (VMC and M1) 
were evaluated significantly lower where the 
E was an expert and observed the Ss’ failures 
and where the Ss had placed a high evaluation 
upon their performances on the puzzle. It is 
evident that abilities M2, M3, and the “‘com- 
plete person” were not, however, rated lower 
where E was an expert and observed the Ss’ 
failures. These results support Hypothesis 2. 
In Part B of Table 2, Ss also evaluated their 
M2 abilities lower, for some unexplainable 
reason, when their failures had been observed 
by a nonexpert rather than an expert. 

It is also relevant to note that participants 
who gave low evaluations to their perform- 
ances on the puzzle, compared to those who 
gave high evaluation to their performances, 
attributed significantly lower evaluations to 
their VMC and M1 when the re- 
sponses of Ss from all conditions were com- 
bined (Col. e in Table 2). There were no 
significant differences, however, in the ratings 
given to M2, M3, or the “complete person” 
(all conditions combined) when the responses 
of persons with high evaluations of their per- 


are 


abilities 


formance were compared with those who had 
low evaluations of their performances. These 
results appear again to indicate that low eval- 
uation of one’s original performance affect 
evaluations of the most closely related abili- 
ties more than they affect evaluations of those 
abilities which are less closely related. 
DISCUSSION 

The results suggest a modification of the 
assumptions and hypotheses, but a modifica- 
tion that is consistent with the rest of the 
theorizing. In the statement of the theory at 
the outset of this paper, it had been assumed 
that individuals would tend to evaluate them- 
selves as highly as possible within the limits 
of reality, but that pressures toward more ob- 
jective evaluations from other persons would 
cause the Ss to depress their evaluations of 
their performances and abilities. The data 
suggest, however, that some persons who fail 
on a task make such low evaluations of their 
performances on that task, and of their abili- 
ties related to their performances, that social 
pressures could not lead to any further de- 
pression in their self-evaluations. Thus, it was 
found that Ss who made low self-evaluations 
of their performances evaluated their closely 
related abilities uniformly low, irrespective of 
whether they were being observed by the E 
and whether he was an expert. 

On the other hand, the high evaluations 
that some Ss made of their performances can 
be taken as a sign that they were motivated 
to make high evaluations of themselves, more 
than were those who made low evaluations of 
their performances. For the highly motivated 
Ss, variations in the degree of publicness of 
the Ss’ failures and in the expertness of the E 
had the predicted consequences on their eval- 
uations of abilities. When they were observed 
by an expert, they conformed to his presum- 
ably stronger influences and to his evident 
low evaluations, and thus evaluated their 
abilities as poorly as did the Ss with low 
evaluations of their performances. In line with 
the original hypotheses, those Ss with high 
evaluations of their performances rated lower 
abilities that were more closely related to the 
performances on the puzzle when observed by 
an expert than when not observed by him, or 
when observed by a nonexpert. Also, in keep- 
ing with the hypotheses, their evaluations of 
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abilities were not different when they were 
observed by a nonexpert than when not ob- 
served by him. Where the assumption that a 
person will try to keep his self-evaluation as 
high as possible is shown by the data to be 
present and operating, then, the experimental 
hypotheses were supported in regard to the 
Ss’ evaluations of their relevant abilities. 
Where this assumption was not fulfilled, the 
hypotheses were not supported. Instead of 
assuming, as was done in the opening para- 
graphs of this paper, that there is a tendency 
in all persons at all times to evaluate them- 
selves as highly as possible, we must assume 
that this tendency is present in some persons. 
Where it is present, the hypotheses proposed 
for this study are best fulfilled. 

This modification of the assumptions and 
hypotheses raises the question as to why some 
Ss attributed high evaluations to their per- 
formance on the puzzle, while others did not. 
In particular, the question is raised in refer- 
ence to the Ss in the public-expert condition 
in Table 2. No conclusive answer to this ques- 
tion is available, although it is likely that 
some of the Ss felt more need to look good in 
the eyes of the E even when they made their 
ratings privately. It is possible that the high 
self-ratings in the public expert condition may 
have been due to resistance to the stronger 
pressures exerted by the E in that condition. 
A complete answer probably requires some 
analysis of the motivations of the Ss when 
rating themselves. 

These results point to the need for a theo- 
retical and experimental refinement of the 
meaning of “expertness” for interpersonal 
relationships. It is not clear from the present 
experiment whether it is the prestige of the 
expert, the power of the expert, or the objec- 
tivity he assumes, which generates the ob- 
tained differences in the evaluations of the Ss 
in the separate conditions. A crucial test 
would be to obtain self-evaluations from Ss 
who were observed by a person whose prestige 
was varied independently of variations in his 
apparent ability to be competent in judging 
the Ss’ performances. 

The results support the second hypothesis 
of the study, namely, that the differences 
among the conditions are limited to evalua- 
tions of abilities most closely related to the 


abilities required for performance of the task. 

Only one significant difference was found 

among the conditions in the evaluations of 

other areas of the self, in all probability a 

result without psychological significance. 
SUMMARY 

In this study we have examined how varia- 
tions in the strength of social pressures, acting 
upon a person in the direction of lowering his 
self-evaluation, affect the evaluations the in- 
dividual assigns to the quality of his perform- 
mance, and to his more or less closely relevant 
abilities, after he has failed on a task. Social 
pressures were assumed to be stronger on the 
person when his performance was viewed by 
an expert than when it was not observed, or 
when it was observed by a nonexpert. Social 
pressures were assumed to restrain a person 
from assigning a high evaluation to the qual- 
ity of his behavior or abilities. 

The subjects were individually and pri- 
vately each asked to solve an impossible 
puzzle which was described as a test of visua!- 
motor coordination. In half of the cases their 
level of performance was observed by the 
experimenter; in the other half it was not ob- 
served. Within each of these conditions the 
experimenter introduced himself to half the 
subjects as an expert and to the other half as 
a nonexpert. After the subjects had failed, 
they rated their performances and abilities. 
The abilities had varying degrees of similarity 
to those under test in the puzzle. 

It was found that: 

1. Ss who made high evaluations of their 
performances on the puzzle, evaluated abili- 
ties that were involved in their performances 
lower when they had been observed by an 
expert than when they had not been observed 
by an expert, or when they had been observed 
by a nonexpert. 

2. Ss who made low evaluations of their 
performances did not evaluate these abilities 
differently under different experimental con- 
ditions. 

3. No consistent consequences from the 
publicness or privacy of failure, nor from the 
expertness of the experimenter were found 
when the results from the entire population 
of subjects was examined in respect to the 


PUBLIC VERSUS PRIVATE FAILURE 


evaluations of the quality of their perform- 
ances and their abilities. 

4. The differences in the effects of the ex- 
perimental conditions were limited to those 
abilities which were most closely related to 
the abilities required by the task; evaluations 
of other parts of the self were less affected. 

It was concluded that strong social pres- 
sures acting on a person toward lower evalua- 
tions cause more lowering in his evaluation of 
his abilities when the person is strongly moti- 
vated to keep his evaluations high, than when 
he has little motivation to keep these evalua- 
tions high. 
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abilities were not different when they were 
observed by a nonexpert than when not ob- 
served by him. Where the assumption that a 
person will try to keep his self-evaluation as 
high as possible is shown by the data to be 
present and operating, then, the experimental 
hypotheses were supported in regard to the 
Ss’ evaluations of their relevant abilities. 
Where this assumption was not fulfilled, the 
hypotheses were not supported. Instead of 
assuming, as was done in the opening para- 
graphs of this paper, that there is a tendency 
in all persons at all times to evaluate them- 
selves as highly as possible, we must assume 
that this tendency is present in some persons. 
Where it is present, the hypotheses proposed 
for this study are best fulfilled. 

This modification of the assumptions and 
hypotheses raises the question as to why some 
Ss attributed high evaluations to their per- 
formance on the puzzle, while others did not. 
In particular, the question is raised in refer- 
ence to the Ss in the public-expert condition 
in Table 2. No conclusive answer to this ques- 
tion is available, although it is likely that 
some of the Ss felt more need to look good in 
the eyes of the E even when they made their 
ratings privately. It is possible that the high 
self-ratings in the public expert condition may 
have been due to resistance to the stronger 
pressures exerted by the £ in that condition. 
A complete answer probably requires some 
analysis of the motivations of the Ss when 
rating themselves. 

These results point to the need for a theo- 
retical and experimental refinement of the 
meaning of “expertness’” for interpersonal 
relationships. It is not clear from the present 
experiment whether it is the prestige of the 
expert, the power of the expert, or the objec- 
tivity he assumes, which generates the ob- 
tained differences in the evaluations of the Ss 
in the separate conditions. A crucial test 
would be to obtain self-evaluations from Ss 
who were observed by a person whose prestige 
was varied independently of variations in his 
apparent ability to be competent in judging 
the Ss’ performances. 

The results support the second hypothesis 
of the study, namely, that the differences 
among the conditions are limited to evalua- 
tions of abilities most closely related to the 
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abilities required for performance of the task. 
Only one significant difference was found 
among the conditions in the evaluations of 
other areas of the self, in all probability a 
result without psychological significance. 


SUMMARY 


In this study we have examined how varia- 
tions in the strength of social pressures, acting 
upon a person in the direction of lowering his 
self-evaluation, affect the evaluations the in- 
dividual assigns to the quality of his perform- 
mance, and to his more or less closely relevant 
abilities, after he has failed on a task. Social 
pressures were assumed to be stronger on the 
person when his performance was viewed by 
an expert than when it was not observed, or 
when it was observed by a nonexpert. Social 
pressures were assumed to restrain a person 
from assigning a high evaluation to the qual- 
ity of his behavior or abilities. 

The subjects were individually and pri- 
vately each asked to solve an impossible 
puzzle which was described as a test of visual- 
motor coordination. In half of the cases their 
level of performance was observed by the 
experimenter; in the other half it was not ob- 
served. Within each of these conditions the 
experimenter introduced himself to half the 
subjects as an expert and to the other half as 
a nonexpert. After the subjects had failed, 
they rated their performances and abilities. 
The abilities had varying degrees of similarity 
to those under test in the puzzle. 

It was found that: 

1. Ss who made high evaluations of their 
performances on the puzzle, evaluated abili- 
ties that were involved in their performances 
lower when they had been observed by an 
expert than when they had not been observed 
by an expert, or when they had been observed 
by a nonexpert. 

2. Ss who made low evaluations of their 
performances did not evaluate these abilities 
differently under different experimenta! con- 
ditions. 

3. No consistent consequences 
publicness or privacy of failure, nor from the 
expertness of the experimenter were found 
when the results from the entire population 
of subjects was examined in respect to the 
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evaluations of the quality of their perform- 
ances and their abilities. 

4. The differences in the effects of the ex- 
perimental conditions were limited to those 
abilities which were most closely related to 
the abilities required by the task; evaluations 
of other parts of the self were less affected. 

It was concluded that strong social pres- 
sures acting on a person toward lower evalua- 
tions cause more lowering in his evaluation of 
his abilities when the person is strongly moti- 
vated to keep his evaluations high, than when 
he has little motivation to keep these evalua- 
tions high. 
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DISTANCE AS A DETERMINANT OF INFLUENCE IN A TWO-PERSON 
SERIAL INTERACTION SITUATION 


SEYMOUR FISHER AND ARDIE LUBIN! 
Walter Reed Army Institute of Research 


HIS paper is concerned with two major 

aspects of a two-person social influence 

situation. The first deals with the 
systematic exploration of the effects of 
“distance” upon measures of influence, where 
distance refers to the size of the discrepancy 
between the subject’s (S’s) pre-influence judg- 
ment and the subsequent judgment originating 
from an influence source (6). The concept of 
distance corresponds closely to “opinion de- 
viance” as used by Festinger (2), Schachter 
(20), Goldberg (8), and others; in Asch’s work 
(e.g., 1), he speaks of “size of discrepancy”’; 
French (7) employs the same phrase in his 
model. These writers, along with Newcomb 
(18), have all suggested in one form or another 
that distance is an important variable in de- 
termining how much S’s judgment can be 
altered in a social interaction. The precise rela- 
tionship, however, seems to be in doubt. Do 
large distances produce more “influence” than 
smaller distances? One difficulty in attempting 
to answer such a question lies in the ambiguity 
of the term “‘influence.”’ As noted in a previous 
paper (6), influence can be measured in at 
least two ways: (a) how much S moved from 
his original judgment, and (6) how much S 
conformed to the source’s judgment (i.e., S’s 
movement relative to the distance). So long as 
distance is held constant, measures of move- 


’ 


ment and conformity are in a one-to-one rela- 
tionship; hence, either can be used satis- 
factorily. When Ss are exposed to different 
distances, however, the two measures are no 
longer directly comparable and yield different 
results. An S who moves 100 units away from 
his pre-influence judgment under a distance 
pressure of 200 is behaving quite differently 
from another S who moves 100 units under a 
distance pressure of 1000; while both Ss have 
moved equal amounts, the former S has con- 
formed .50 (i.e., has moved half-way toward 
the influence source’s judgment) as contrasted 
with the latter S who has conformed only .10. 

1 Grateful acknowledgment is made to the invaluable 


technical assistance of Esther Adams, Nicholas Covatta, 
E. E. Admire, and Hilda Wexler 


Peter Kyriazis, Major 


One of the purposes of this study was to in- 
vestigate parametrically the way in which 
movemer* and conformity are related to the 
independent variable of distance. 

The second problem under investigation con- 
cerns behavior in a serial interaction situation, 
where S is exposed to a number of influence 
attempts on successive, discrete stimuli, each 
common to the same broad dimension. Whereas 
most experimental studies seem to focus on the 
effects of one or more influence trials treated as 
a single influence attempt (measuring the 
difference between pre- and _post-influence 
judgments), it is more usual in real life that 
Ss are exposed to a set of influence attempts 
from one source, sometimes on the samie issue, 
sometimes on different issues. In the present 
study, the concept of “serial interaction” was 
experimentally simplified by studying Ss’ re- 
actions to two consecutive stimuli. Three 
questions were being asked here: (a) What are 
the effects of repeated influence attempts on 
the same stimulus? (6) What carry-over effects 
does a series of influence attempts on one 
stimulus have upon the perception of a second, 
similar stimulus? (c) What effects does a series 
of influence attempts on one stimulus have 
upon subsequent influence attempts on a 
second, similar stimulus? 


METHOD 


The data have been obtained from two separate 
studies, the second essentially a replication of the first. 
The Ss were army enlisted men from Fort Meade, 
Maryland. Both studies required Ss to judge the num 
ber of paratroopers shown in briefly exposed photo 
graphs. In the original experiment, 96 Ss were used in 
a 3x 2.x 2 factorial design. The independent variables 
consisted of three different distances (40, 320, and 5100 
two experimenters (Fs), and two sets of instructions 
(one emphasizing accuracy, the other emphasizing 
agreement with partner). A pilot study showed that a 
significant experimenter-distance interaction would re- 
sult unless instructions were rigidly standardized; in 
the study, instructions, Es, and interactions were 
analyzed—none was significant. The second experiment 
used a total of 192 Ss, divided into six distance groups 
0, 40, 160, 320, 1300, 5100), 32 Ss per group; two 
different Es ran the study, with standardized instruc 
tions emphasizing accuracy 

Two Ss at a time were brought into the laboratory 
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and seated side by side in front of Z, who read the pre- 
instructions to them. In brief, these instruc- 
tions said that £ was interested in seeing how accurately 
people could judge the number of troops shown in 
photos for short periods of time; further, whether 
people could be more accurate when they were working 
together and able to talk things over than when they 
were by themselves and could not discuss their judg- 
Each S was given a pad of colored paper to 
write down his judgments, the color of the pad serving 
as identification of S. 


minary 


ments 


‘Warm-up” Period 


E began by exposing a photo for 10 seconds. After 
the photo was removed, the two Ss wrote their inde- 
pendent estimates on their pads. The estimates were 
unnounced to the Ss, and (if Ss disagreed) they were 
told to “talk it over between yourselves and try to be 
(accuracy instructions). After 
minutes of discussion, each down his 
estimate, after which E announced the true 
number, and Ss were permitted to examine the photo 
and verify E’s statement. This warm-up period was 
extended over five photos and served a number of 
important became accustomed to 
estimating the number of paratroopers, and also be 
came familiar with the procedure of making more than 
me estimate to the same photo. (6) The credibility of 
the subsequent experimental phase was enhanced. Each 
time S wrote down an estimate during the warm-up, 
the papers were taken by E£ to an assistant a few feet 
and partially out of sight) who recorded and 
audibly stamped the estimates by machine. Thus, in 
the experimental phase which followed, Ss accepted the 
routine of having EZ take their estimates to the assistant 
before they were exchanged (permitting the assistant 
to intercept the original estimates and exchange them 


as accurate as possible” 
a lew wrote 


Sec ond 


functions: (a) Ss 


away 


for bogus messages 

The Ss were then told that for the remaining time 
they were to work alone, with no talking between them; 
all they could do was exchange their estimates. They 
were placed in separate, adjacent cubicles where they 
each other. Each S retained the same 
colored pad he had used in the “warm-up.” 


( ould not see 


Experimental Period 


In the account of the procedure which follows, the 
major measures which were analyzed are indicated in 
parentheses immediately following the operations to 
which they refer 

E exposed Card A, a photo containing 165 para- 
troopers, for a 10-second interval, and Ss wrote down 
their initial estimates (A,). Z picked up the estimates 
to be stamped and recorded, during which time the 
bogus estimates were written by the assistant on ap- 
propriately colored paper. Hence, when the estimates 
were “exchanged,”’ each S was led to believe that he 
was receiving his partner’s estimate, whereas he actually 
received an estimate which was a fixed distance greater 
than his initial estimate (except, of course, that Ss in 
the zero distance group received bogus estimates 
identical with their A,). Ss then made their second 
the estimates were again “exchanged,” 


made | 


estimate (Ag), 


und a total of five estimates was by each S to 
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Photo A. The bogus estimates always remained the same 
for any 5 throughout the five interchanges, regardless 
of whether or not S changed his estimate at any time. 
After the fifth estimate, Photo A was exposed again for 
10 seconds, and Ss were asked to make a “final” esti- 
mate (A,.).2? A, apd A; were the only estimates made 
immediately after seeing Photo A, without an interven- 
ing bogus estimate. 

Card B, a photo containing approximately 300 
paratroopers, was then shown for 10 seconds, and Ss 
made their initial estimates (B:) to this photo. The 
messages were “exchanged,” but this time all Ss, re- 
gardless of the distance received on Photo A, received a 
bogus estimate 320 greater than B,. The Ss made a second 
estimate (B:), after which Photo B was again exposed 
for 10 seconds, and Ss made a “final” estimate (B,). An 
additional measure, taken only in the replication study, 
consisted of giving S, at the end of the experiment, 
Photos A and B for a period of three minutes during 
which time he was asked to count the actual number of 
paratroopers in the two photos (Ay and By, respec- 
tively). 

? In the replication, half of the Ss were not given the 
A, exposure at this point; instead, it was delayed until 
after S made his Bz estimate (vide infra). For all of the 
measures analyzed, this variation in treatment pro- 
duced no significant effects, hence the A, estimates 
have been pooled 
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RESULTS 
The First Stimulus (Card A) 


Iniratrial effects. In the analysis of these 
data, two separate measures of influence were 
utilized. Movement is the amount of change 
from S’s initial estimate (A) to a subsequent 
estimate for the same stimulus; thus M,, indi- 
cates S’s change on Card A from A; to Az; Ma, 
indicates the change from A, to As (last esti- 
mate). Conformity is S’s change proportional to 
the distance between him and his partner; 
thus, for an S who was exposed to a distance of 
40 on Card A, C,, was computed by dividing 
M,, by 40; Cy, was computed by dividing 
Ma, by 40. 

Fig. 1 shows the frequency distributions of 
M,, for the six distances used (note that the 
abscissa is scaled geometrically for con- 
venience). At large distances, distributions 
take on a marked positive skew. Similar skewed 
distributions obtain for virtually all variables 
measured after exposure to different distances. 
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Because of this skew, the data were analyzed 
using the median and other nonparametric 
statistics. 

In Fig. 2, 
fluence and distance are presented. Influence on 
Photo A is measured by movement and con- 
formity after the first influence attempt in 
Figs. 2a and 2b, after the last influence attempt 
in Figs. 2c and 2d. The following facts mav be 
gleaned from the curves: 

1. The results are quite stable, as evidenced 
by the similarity of the data obtained from the 
two experiments (all medians are based upon 
an n of 32). 

2. The median amount of 
creases with distance—at least up to a certain 
point, after which a decreasing trend is sug- 
gested. One might expect, a priori, that for a 
particular individual, movement will increase 
with distance until a point of subjective 
“incredulity” or “implausibility” is reached, 
above which the influence source will be disre- 
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garded—-with movement abruptly dropping to 
zero. 

3. In contrast with movement, conformity 
is a decreasing nonlinear monotonic function of 
distance. 

4. With repeated influence attempts on the 
same stimulus, the amount of influence (both 
movement and conformity) is increased. This 
is particularly true of movement scores, seen at 
all distances; C,, and C,,, however, are essen- 
tially the same at the larger distances. Maxi- 
mum influence generally occurs on the first 
influence attempt (cf. 8, 9). 

Persistence of influence. It will be recalled 
that after the five estimates on Photo A were 
“exchanged,” S was again shown the same 
photo for 10 seconds and asked to give his 
“final” estimate (A,). It was expected that, for 
those Ss who had been influenced from A; to 
As, an additional exposure to “reality” would 
permit them to correct themselves and give 


more accurate estimates. Fig. 3, however, 


shows that this does not occur. Rather, there 
is some tendency for the groups fo manifest still 


further “latent” influence effects on A,. In both 


experiments, the median A, estimates are 
always equal to or greater than the Ag esti- 
mates. Approximately 40 per cent of all Ss 
moved from A; toward the bogus estimate (and 
away from “reality”) on Ay, while another 10 
per cent (whose Ag, estimates were all below 
165) also moved toward the bogus estimate, 
coming closer to the true number; 25 per cent 
of the Ss did not change their A; judgments at 
all, and the remaining 25 per cent moved away 
from the bogus estimate back toward “reality.” 

While it appears, then, that influence per- 
sists when S is re-exposed to the same am- 
biguous situation (without any direct influence 
attempt being made), it should not be inferred 
that these effects represent an enduring change. 
When S is permitted a relatively extensive 
reality contact with the stimulus (i.e., giving 
him three minutes to count at the conclusion 
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of the study), the A, estimates do tend to drop 
back toward the true number, regardless of 
previous distance-experience: Ss seem capable 
of shaking off the influence effects to a con- 
siderable degree when the ambiguity of the 
stimulus is removed, although the over-all 
median Ay, (184) still is somewhat higher than 
the true number (165). This apparent overesti- 
mate is difficult to interpret clearly, since an 
adequate control group is lacking (Ss in the 
zero-distance group were exposed to a distance 
of 320 on Photo B prior to their Ay estimates, 
and therefore do not provide an appropriate 
comparison); before one could validly infer 
that the elevation of the obtained Ay, curve 
indicates some persistence of influence, it 
would be necessary to show that Ss in the 
absence of any previous influence attempts are 
able to estimate accurately (or tend to under- 
estimate) Photo A during a comparable three- 
minute counting session. 


The Second Stimulus (Card B) 


The intertrial phenomenon. In an earlier 


study of continuous social influence inter- 
actions (6), it was found that under certain 
conditions of immediate self-committal (i.e., 


where S gives an explicit judgment just prior 
to an influence attempt), a considerable 
amount of influence may be reflected between 
rather than within interactions (trials). When 
$ has been under continuous pressure, applied 
in some consistent pattern, it is not uncommon 
to observe S appearing to anticipate the in- 
fluence source on subsequent interactions. 
Marked changes are seen occurring in .S’s com- 
mittal responses themselves, prior to any overt 
attempt to alter his judgment on any particular 
stimulus. 

It was expected that the intratrial pressures 
exerted on S during the five influence attempts 
on Photo A would be reflected by a systematic 
increase in the initial estimate (committal re- 
sponse) to Photo B (B,). From Fig. 4 it is clear 
that this indeed occurs. B, is significantly re- 
lated to the distance S was exposed to on 
Photo A: the larger the distance on Photo A, 
the larger the median B,;. Groups exposed to 
large distances on a stimulus may be highly 
resistant to intratrial influence on that par- 
ticular stimulus, but when exposed to a new 
stimulus (prior to any direct influence at- 
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tempt), the previous pressures become patently 
evident. A number of Ss in the 5100-distance 
group showed little intratrial movement on 
Photo A, yet gave initial estimates of 4500 or 
higher to Photo B (containing 300 para- 
troopers). 

Intratrial effects. What happens when Ss 
who have been exposed to different distances 
on one stimulus are subsequently exposed to a 
uniform distance (320) on a second stimulus? 
It has already been seen that their initial esti- 
mate (B,) of this second stimulus is signifi- 
cantly related to their previous distance-ex- 
perience (intertrial effect). But after they have 
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committed themselves to Photo B, how easily 
can their judgments be further altered? Will Ss 
who were exposed to a distance of 5100 on 
Card A behave differently from Ss who were 
exposed to zero distance? Fig. 5 indicates that 
there is no systematic relationship between Cz, 
the amount of conformity on Photo B, and 
distance on Photo A. Despite the fact that the 
5100-distance group had already demonstrated 
intertrial influence on Photo B, this group 
appears to change neither more nor less than 
any other group—including the zero-distance 
Ss who show essentially no intertrial effects. 

The rank-order correlations of Cs, with 
several other variables (e.g., C,,, Ai, As, B,) 
were computed within each distance-group, but 
no significant relations were obtained. Under 
the conditions of the present study, then, Cs, 
cannot be demonstrated to be related to any of 
the temporally previous variables measured, 
including distance on Photo A. 

Persistence of influence. When Ss were given 
a ten-second re-exposure to Photo B (B,), 
Fig. 6 reveals a marked tendency toward 
maintenance of the previous influence effects, 


similar to the results on Photo A. The B, curve 
closely follows the Bz curve at all distances: Ss 
do not show any strong indication of dropping 
back to “reality” even at the extreme distances. 
However, the “latent” influence effect on B, 
is not as clear as for Aug. 

When given three minutes to count Photo B 
at the conclusion of the study, Ss’ estimates 
(B,) are now independent of the distance on 
Photo A. All groups have similar median B, 
judgments (approximately 500), but all 
medians are considerably above the true num- 
ber of paratroopers (300). As noted in the 
case of A,, interpretation is equivocal in the 
absence of proper controls, but there is cer- 
tainly the suggestion that the “counting”’ be- 
havior of these Ss may be modified by previ- 
ously applied pressures. Since it was virtually 
impossible to make an accurate count of the 300 
paratroopers on Photo B (quite different from 
Photo A, which contained only about half the 
number), the B, measurement becomes in 
effect a continued influence situation—differing 
from B, (seen for only 10 seconds) only in the 
degree of stimulus ambiguity. A few Ss in the 
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1300 and 5100 distance groups who had given 
high B, and B, estimates were observed to go 
through the complete motions of “counting,” 
and then give B, judgments of 3500 or 4000! 
DISCUSSION 
The Effects of Distance 
One of the major questions in this research 
concerned the relationship between measures of 
social influence and distance. The results 
clearly indicate that while movement tends to 
increase (at least up to a certain point) with 
distance, conformity is a monotonic nonlinear 
decreasing function. It would seem, then, that 
distance is a significant determinant of social 
influence, and hence must be considered in any 
attempt to formulate theoretical models of in- 
fluence—particularly since a very recent paper 
by Hovland and Pritzker (11) reports similar 
relations using attitudinal stimuli low in ego- 
involvement. These results do not appear to 
support Goldberg’s earlier finding (8) that con- 
formity is independent of distance. The present 
results are also pertinent to the Osgood and 
Tannenbaum model (19), French’s theory (7), 
and Mausner’s (17) recent discussion. 


Intertrial Effects and Persistence of Influence 

A striking phenomenon consistently ob- 
served in recent studies of social influence in 
continuous interactions (6) concerns the large 
amounts of influence manifested between inter- 
actions (i.e., in this instance, stimuli A and B). 
In the present experiment, Ss on Photo A not 
only tended to remain influenced when the 
same stimulus was presented a second time 
(A,), but also showed a carry-over effect when 
a new stimulus (Photo B) was presented: Ss 
demonstrated gross over-estimation tendencies 
positively related to the distance-pressure re- 
ceived on the first stimulus. These intertrial 
effects, occurring prior to any overt, direct in- 
fluence attempts on the second stimulus, often 
seem to rest in “anticipatory” attitudes in S. 
Just as expectations play an important role in 
hypnotic behavior (3, 4), similar factors seem 
to be operative in nonhypnotic continuous in- 
fluence situations. To the extent that S can 
perceive a more or less consistent bias in the 
source’s judgments, his frame of reference (22) 
is altered, and he subsequently tends to mani- 
fest indirect influence effects in giving his in- 


itial, pre-influence (committal) judgments to 
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new stimuli. Some Ss may even be quite re- 
sistant to intratrial pressures on the first 
stimulus, and then show a large intertrial 
effect. (This phenomenon may be a generaliza- 
tion of the “sleeper effect,” noted in a number 
of attitude-influence studies [e.g., 10, 13]: the 
main difference seems to be that whereas the 
“sleeper effect” refers to increased influence 
observed in subsequent retesting on the same 
stimulus, the intertrial effect refers to increased 
influence seen during testing on a mew but 
related stimulus.) 

When the stimuli all belong to a quantitative 
dimension, as in the present study, it is proba- 
ble that some psychophysical “anchoring” 
effects from S’s previous judgments partly 
contribute to the intertrial change—i.e., if an 
S was actually influenced on Photo A, then he 
will overestimate Photo B merely to remain 
consistent within his newly established frame 
of reference; this sort of behavior need not be 
described as “anticipatory,” since it is similar 
to the type of judgment change produced by 
Tresselt and Volkman (24) in the absence of 
social interaction. However, as Sherif (19, 20) 
has pointed out, both anchoring and antici- 
patory judgments have in common a shift in 
S’s frame of reference. Thus, in some cases, S 
may be anticipating his partner, attempting to 
“beat him to the punch” (i.e., minimize the 
anticipated distance between himself and the 
influence source). In other cases, S may still be 
“anticipating” the experimental situation, 
which requires a new frame of reference based 
upon S’s previous experience in the situation. 


Subsequent Intratrial Influence 


The finding that conformity on Photo B is 
independent of the distance on Photo A raises a 
question. The present results seem to indicate 
that distance on the first interaction is a basic 
determinant only of (a) intratrial movement on 
that first interaction, and (6) intertrial move- 


ment between the first and second inter- 
actions. Can it be concluded, then, that in real 
life continuous social influen::: situations, the 
distance-exposure of an S on one interaction 
has no predictive power in regard to his intra- 
trial behavior in a subsequent interaction on a 
related issue? This is quite unlikely, when one 
considers the results of a variety of experi- 
ments on “prior reinforcement.” 

When Ss, following an initial interaction (or 





DISTANCE 


series), have an opportunity to /earn something 
about the their judgments 
and/or those of their partners (i.e., the in- 
fluence source), this learning then affects their 


correctness of 


subsequent judgments differentially (12, 14, 15, 
16, 17, 21). For example, if there is some sort 
of feedback (knowledge of results, or “valida- 
so that S learns he was “right” and/or 
partner “wrong” on the initial interaction, 
then subsequent intratrial conformity will be 
modified. Goldberg and Lubin (9) further 
found that, when S is given some validation in- 
formation, the distance on the preliminary 
interactions becomes a significant determinant 
of subsequent intratrial conformity. 

Yet the data from the present study show no 
such effects: Ss interacting with a peer partner 
at different distances on one interaction do not 
show differential conformity on the next inter- 
action. These findings suggest that, unless S 
can “validate” his own and/or the influence 
source’s judgments in any initial interaction, 
his intratrial conformity behavior on a second 
interaction will be independent of the distance 
he received on the first interaction. Thus, the 
present study indicates that the variable of 
distance plays a salient role not only on a par- 
ticular influence attempt, but in continuous 
social influence processes. (It has also been sug- 
gested [5] that distance may be a fruitful con- 
cept in the analysis of related influence 
processes such as psychotherapy.) 


tion” 


SUMMARY AND CONCLUSIONS 

Pairs of Ss were placed in a social influence 
situation requiring judgments of the number 
of paratroopers seen in briefly exposed photo- 
graphs. Two photos were used successively 
Cards A and B) after a preliminary warm-up 
neriod. On Photo A, distance (the size of the 
discrepancy between S’s judgment and his 
partner’s) was systematically varied by inter- 
cepting S’s written estimate and replacing it 
with a bogus estimate at a predetermined 
distance. After a number of judgments were 
obtained on Card A, the Ss were exposed to 
Card B. On this latter photo, all Ss were ex- 
posed to the same distance. The experiment 
was replicated with the following major results: 

1. At large distances, distributions of the 
amount of influence on Photo A take on 
marked positive skewness, the amount of skew 
increasing with distance 
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2. The amount of influence on Photo A is 
significantly related to distance: (a) Move- 
ment (the amount of change from initial 
judgment) increases with distance up to a 
certain point; there is a slight suggestion (sup- 
ported by rational consideration) that the 
curve begins to decrease at extremely large 
distances. (b) Conformity (movement relative 
to distance) decreases with distance in a mono- 
tonic fashion. 

3. The amount of influence obtained on 
Photo A can be increased by sheer repetition 
of the suggestion, in the absence of any “side 
arguments,” persuasive reasons, or elaborate 
rationales. 

4. When Ss are permitted a second brief 
exposure to the stimuli, there is no evidence 
that Ss “correct” their previous post-influence 
judgments toward “reality.” In fact, on Card 
A in particular, the data indicate that Ss tend 
to show increased “latent” influence on the 
re-exposure. However, when Ss are permitted 
to count the paratroopers at the end of the 
experiment, the judgments are no longer 
related to distance. Part of the earlier influence 
effects may carry over to this “counting” be- 
havior, depending upon the ambiguity of the 
stimuli. 

5. Considerable amounts of influence are 
manifested between interactions on successive 
stimuli (intertrial effect). Marked changes are 
seen in 5S’s initial response to the second 
stimulus, before any overt attempt to alter 
his judgment on this particular photo. The 
intertrial effects are positively related to the 
distance on the first stimulus. A number of 
Ss who were resistant to intratrial influence 
at large distances on Photo A showed consider- 
able intertrial change. 

6. Intratrial influence on the second stimulus 
was found to be independent of the distance 
on the first stimulus. It was hypothesized that 
unless S can “validate” his own and/or his 
partner’s judgments in an initial interaction, 
his intratrial influence behavior on a second 
interaction will be unrelated to the distance 
on the first interaction. 
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THE INFLUENCE OF LANGUAGE-ACOUISITION CONTEXTS 
ON BILINGUALISM!’ 
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HE bilingual person presumably must 

learn two different verbal responses for 

each referent, a requirement analogous to 
learning different responses to the same stimu- 
lus in experimental studies of negative transfer. 
It is of psychological interest, therefore, that bi- 
linguals are so often able to communicate freely 
in either language without interference from the 
other. As yet, no adequate psychological theory 
has been offered to account for bilingualism. In 
his review of various limited explanations, 
Weinreich concludes that “a comprehensive 
psychological theory of bilingualism ought to 
account for both the effectively separated use 
of the two languages and for interference of the 
languages with one another” (9, p. 71). It is 
the purpose of the present investigation to con- 
tribute to such a theory by analyzing condi- 
tions that are presumed to affect both the 
“effectively separated use” of the bilingual’s 
two languages and the “interference of the 
languages with one another.” 

As a first step, it may be profitable to apply 
certain principles derived from paired-asso- 
ciate learning and transfer phenomena to the 
case of learning two languages.’ In the experi- 
mental study of retroactive inhibition, the 
learner must first link S; to R; and, in the later 
interpolated phase, S; to Re. When alternative 
responses are associated with a common stim- 
ulus, interference between the two associations 
is the usual consequence. The bilingual is con- 
fronted with an analogous learning problem 
when, for example, the symbols “church” and 
“éelise”’ are both associated with the appro- 
priate environmental event. 

One general means of minimizing interference 
in learning situations is to increase the distinc- 
tiveness of the two tasks to be learned.‘ In 
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In 1915, Epstein (2) considered bilingualism in 
terms of such a psychological theory, but did not 
experimentally test its applicability. 

‘ Essentially the same reasoning underlies the “di- 
rect method” of language teaching. The direct method 


studies of retroactive inhibition, for example, 
interference is effectively minimized by in- 
creasing the degree of learning of original and 
interpolated tasks or by decreasing the associa- 
tive similarity between the competing re- 
sponses. Both these variations enhance the 
distinctiveness of the tasks. If distinctive sec- 
ondary cues, such as differently colored back- 
grounds, are associated with the two tasks to 
be learned, the functional separation of the 
tasks should be increased. 

Following the logic of interference theory, we 
can reason that response competition in bilin- 
gualism is functionally related to the distinc- 
tiveness of contexts in which the bilingual’s two 
languages are acquired; the more “separated” 
the contexts of acquisition, the less the bilin- 
gual interference. Acquisition contexts are 
considered to be separated when the associa- 
tion between a symbol such as “church” and 
the appropriate environmental event has con- 
sistently taken place in a setting distinct from 
that in which the association between “‘église”’ 
and its corresponding environmental event 
occurs. On the other hand, in “fused” acquisi- 
tion contexts, members of a linguistic com- 
munity can and do use both symbols for any 
particular environmental event, thereby in- 
creasing the interchangeability of the symbols. 

Recently, Ervin and Osgood (4) attempted a 
theoretical interpretation of bilingualism that 
indicates the potential importance of language- 
acquisition contexts. They suggest that bilin- 
guals could theoretically acquire “compound” 
or “coordinate” language systems. The com- 
pound system ~/ould be developed through ex- 





theorists maintain that the acquired language should 
be learned directly from association with environ- 
mental events without the mediation of the native 
language. When learning French as a second language, 
one should never link “pomme” with “apple” but should 
link “pomme” directly with the perceived environ- 
mental event. In contrast to the “indirect”? method 
which utilizes translation as an aid in acquisition, the 
direct method is purported to develop “pure” bi- 
lingualism, the ability to move from one linguistic 
and semantic world of experience to another with 
comparatively little interference. 
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perience in fused contexts, as with vocabulary 
training in school, or where the same family 
members use two languages interchangeably to 
refer to the same environmental events. The 
coordinate system would be developed through 
experience in different linguistic communities 
where languages are rarely interchanged. They 
suggest that the a'ternative symbols in his two 
languages have a single meaning for the com- 
pound bilingual whereas the coordinate bilin- 
gual, presumably because of the distinctiveness 
of his language-acquisition contexts, develops 
separate meanings for each of his alternative 
symbols. 
Accepting this theoretical position, we hy- 
pothesized that experience in “separated” lan- 
guage-acquisition contexts enhances the func- 
tional separation of the bilingual’s two lan- 
guages while experience in “fused” contexts 
reduces the functional separation of the two 
language systems. We predicted that bilinguals 
with experience in separated contexts would 
show comparatively greater semantic differ- 
ences between symbols in one language and 
their transiated equivalents in the other, more 
associative independence of translated equiva- 
lents in their two languages, and less facility of 
switching from one language to the other. 


METHOD 


Classification of Subjects 

The Ss were college or university students 
with extensive experience in both English and 
French languages. Since all but four were either 
preparing for or actually in graduate schools, 
it can be assumed that the group was relatively 
homogeneous in intelligence. Each S was tested 
for extent of bilingualism (5, 6) and the 32 who 
met our criterion for bilingual “balance” will 
be discussed here. “Balance” implies a high 
level of second language competence, high 
enough, in fact, that no dominance of one 
language over the other is measurable with the 
technique employed. 

Our primary task was to classify Ss accord- 
ing to their language-acquisition contexts and, 
to this end, detailed information was obtained 
from each S concerning how, when, and where 
his languages were acquired. Acquisition con- 
texts were considered separated when one 
language was learned exclusively in the home 
and the other exclusively outside the home, 


W. E. LAMBERT, J. HAVELKA, AND C. CRosBy 


when one parent consistently used one lan- 
guage and the other parent a different lan- 
guage, or when one language was acquired in a 
particular national or cultural setting distinct 
from that in which the second was acquired. 
Acquisition contexts were considered fused 
when both parents used the two languages 
indiscriminately, when both languages were 
usel interchangeably inside and outside the 
home, or when an individual acquired his 
second language in a school system stressing 
vocabulary drill and translation and where 
the first or native language was used as the 
medium of instruction. This scheme of classifi- 
cation is derived from that used by Ervin and 
Osgood (4) in their definitions of compound 
and coordinate systems. The actual classifica- 
tion of Ss as having separated or fused contexts 
for acquisition is essentially an arbitrary one 
in spite of the rather explicit criteria for place- 
ment. In five cases, for example, a problem 
arose in deciding whether to give credit for 
some experience with separated contexts when 
an S had early experience in a fused context, as 
when an English S acquired French in a school 
system but later used this language exclusively 
in France for some period of time. Our decision 
was to classify such an S as separated if he 
spent at least an entire year using the acquired 
language either exclusively or primarily. Using 
this scheme, 23 Ss were classified as “sepa- 
rated” and 9 as “fused.””® 

It was also possible to further subdivide the 
“separated” group according to the following 
line of reasoning. An S could be classified as 
“separated” if he had experience either in dis- 
tinctive acquisition contexts within a particu- 
lar national setting, as in Quebec, or in two 
geographically different cultures, as from living 
many years in France before coming to Amer- 
ica. We predicted that separated contexts 
based on “bicultural” experiences, as in the 
latter example, would particularly enhance the 
distinctiveness of the two contexts. In this 
case, the environmental events themselves may 
often not be common for the two sets of sym- 
bols. Forexample, “‘église” could mean a gothic 
cathedral while “church” could mean a tall 


5In view of our method of selecting Ss (see 6) 
where an attempt was made to draw equal numbers 
of both types, one may safely generalize that it is 
comparatively easier for bilinguals with separated 
acquisition contexts to attain bilingual balance 
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wooden building used on Sundays, and “ami’’ 
and “friend” could take on quite different 
meanings. A recent study by Ervin (3) demon- 
such environmental differences 
affect the perceptual and personality function- 
ings of bilinguals. With these considerations 
in mind, the “separated” group was subdivided 
iato 15 Ss with bicultural and 8 Ss with “uni- 
cultural” experiences. 


strates how 


Behavioral Indices of the Influence of Acquisi- 
tion Contexts 


As a measure of the differences in meanings 
of concepts in the bilingual’s two languages, we 
made use of Osgood’s “semantic differential” 
(7), a method of scaling stimulus-words on a 
standard set of meaning dimensions. For ex- 
ample, an S is presented with a stimulus-word, 
such as “house,” and is asked to rate it along a 
seven-point dimension, such as “fast-slow”’ or 
“pleasant-unpleasant.” In the present case, 
the stimulus-words were four common English 
concepts (“house,” “drink,” “poor,” “me’’) 
and their French translations. They were pre- 
sented in a fixed order so that any word was 
maximally separated from its translated pair. 
Ten standard dimensions (see 8) were used and 
their order of presentation was continually 
changed to eliminate any patterning of ratings. 
The S might rate “house” at Position 3 on the 
“fast-slow” dimension, and later “maison” 
might be rated at Position 5 (more slow) on 
the same dimension, which had been trans- 
lated into French. Since both stimulus-words 
and the anchors of each dimension were trans- 
lated, we can only make statements about 
differences in meanings of both stimulus-words 
and dimensional attributes, or their interac- 
tion. The average degree of difference between 
the French and English semantic ratings for 
the four pairs of stimulus words was deter- 
mined for each .S using Osgood’s D score (7); 
the larger the D score, the greater the semantic 
differences between translated equivalents in 
French and English. We predicted that Ss 
with experience in separated language-acquisi- 
tion contexts would show comparatively 
greater semantic differences between translated 
equivalents in their two langnages. 

As a measure of the degree of associative in- 
dependence of translated equivalents, we made 
use of the retroactive inhibition design. If an S$ 
learns list A, then list B, then re-learns list A, 
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one can determine the amount of associative 
interference imposed by the interpolated ma- 
terial. If the interpolation has no effect on the 
previously memorized material, then the two 
can be considered functionally *ndependent. 
Interference and facilitation from the inter- 
polated material may be thought of as indices 
of the functional dependence of the first- 
learned and interpolated tasks. 

Each S learned a list of 20 commonly occur- 
ring English words by the anticipation 
method. A criterion of 8 correct responses out 
of 20 was chosen so as to permit improvement 
following the interpolated task. This was fol- 
lowed by three presentations of a list of 20 
three-letter nonsense words. Then the English 
list was re-presented and the S’s retention was 
measured. Finally, Ss were asked to relearn 
the nonsense list, but these data will not be 
discussed here. This preliminary test gives a 
measure of the associative interference or 
facilitation imposed by a meaningless task, one 
assumed to be semantically independent of the 
original task. After a rest period, the same pro- 
cedure was repeated, but this time a second, 
similar list of 20 English words, learned to the 
same criterion, was followed by a list of their 
exact translations in French, presented for 
three trials. A score was assigned to each S§ 
which indicated the degree of associative inter- 
ference or facilitation imposed by the interpo- 
lated French material in relation to interpo- 
lated meaningless material. The following 
illustrative case indicates some facilitation 
from both interpolated tasks but more from 
the interpolated French. The number of cor- 
rect responses for the criterion trials are placed 
in Column A, and the number of correct re- 
sponses for retention trials are given in Column 
A’, for the English-Nonsense-English and the 
English-French-English series: 


} A I A’ A’ 

| be 
Series I: E; = 10; N(3 trials) | E, = 11| —1 
Series IT: | E, = E, = 15 | —5 


10; F(3 trials) 


I-II = +4 


Positive scores indicate that the interpolated 
French translations facilitate retention of 
English words more than do interpolated non- 
sense words. Negative scores ind cate that in- 
terpolated French translations interfere with the 
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retention of English words. A zero score would 
indicate that the interpolated French transla- 
tions offer no more facilitation than interpo- 
lated nonsense words. Following the reasor.ing 
outlined above, the closer the scores approach 
zero, the more associatively independent the 
translated equivalents. We predicted that Ss 
with experience in separated acquisition con- 
texts would show more associative indepen- 
dence between translated equivalents in their 
two languages than those with experience in 
fused contexts. 

Finally, we predicted that experience in 
fused as compared to separated contexts would 
express itself in a greater facility of switching 
from one language to the other. As a measure 
of such facility, we made use of a speed of trans- 
lation test described elsewhere (6). Each S, 
given 20 French and 20 English words to 
translate, was assigned a score for his average 
speed of translation from English to French 
and from French to English. Speed was meas- 
ured by a voice key in circuit with a chrono- 


scope. 
RESULTS AND DISCUSSION 


As predicted, those bilinguals who acquired 
their languages in separated contexts showed 
a significantly greater difference in meanings of 
translated equivalents than did those who 
acquired their two languages in fused contexts 
(see Table 1). However, when a comparison 
was made between those who acquired their 
two languages in geographically distinct cul- 
tures (bicultural experience) and those who 
acquired both languages in separated contexts 
within one geographical region (unicultural 
experience), it became clear that bicultural ex- 
perience accounted for the over-all difference 
between fused and separated groups. In fact, 
Ss with unicultural experience are not essen- 
tially different from either of the other two 
groups with respect to similarity of meanings 
of translated equivalents. It appears, therefore, 
that the difference in meanings of translated 
equivalents is not a simple function of experi- 
ence in separated acquisition contexts. Expe- 
rience in bicultural acquisition contexts, where 
there is a greater likelihood that the actual 
referents of translated symbols are different, 
does appear to affect the semantic aspects of 
the bilingual’s two languages. 

Bilinguals with experience in separated ac- 
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TABLE 1 
SEMANTIC DIFFERENTIAL COMPARISONS 


Mean 
English- 

French Var 
Profile iance 
Separa- 

tions* 


b a = 


Contexts $ 
tests 


Fused (F) 3.19 0.38 
Separated (S) 2 3.93 1.23 

Bicultural (B) 15 4.22 1.47 

Unicultura! (U) 8 3.39 0.33 
* Entries are mean D scores (7). The larger the mean, the 
greater the semantic differences between languages 

> Where the variances were not homogeneous, p values were 
determined according to a formula suggested by Edwards (1, p 
273 ff). All ¢ tests in this and the following tabies are 2-tailed tests 


of significance 


TABLE 2 
ASSOCIATIVE-INDEPENDENCE COMPARISONS 


Mean 
Independ- 
ence 


Indices* | 


. Var- ‘ f 
Contexts iance tests values 
<.02 
<.05 
<.02 
<1 n.s. 


+3.44 9.75| F 
Separated (S) 11.23| F 

Bicultural (B) | | 10.86| F 
Unicultural (U) | 8 —0.75 } 12 28\B 


Fused (F) .6 


5 
+0.13 12 
55 


+0.60 


S 
B 

U: 
U: 


*The closer the scores approach zero, the more between- 
language associative independence indicated. A constant of 8 was 
added to each score to eliminate negative values before variances 


were computed 


quisition contexts also showed significantly 
more associative independence of translated 
equivalents in their two languages than did 
those with experience in fused contexts (see 
Table 2). An interpolated list of French trans- 
lations had about the same influence for the 
separated group as an interpolated list of 
nonsense syllables. Ss in the fused group, on 
the other hand, clearly benefited from the 
interpolated French list. 

These findings are consistent with those 
summarized above for semantic differences. 
Since translated equivalents were semantically 
more similar for the fused Ss, the interpolated 
translations in the learning problem were more 
like an extension of the original English list. 
It is logical therefore that these Ss were 
helped more by the interpolated French than 
were the separated Ss. Sincejtranslated ¢ quiva- 
lents have comparatively different meanings 
for the separated Ss, interpolated translations 
should have shown less associative facilitation 
and possibly more interference with the origi- 
nal English list, as was the case. 

This interpretation has its limitations, how- 
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ever, since the data in Table 2 indicate that 
there is no difference in degree of associative 
independence between bilinguals with bicul- 
tural and those with unicultural experiences. 
In fact, both these subgroup means are sig- 
nificantly different from that of the fused 
group. This finding suggests that experience in 
separated contexts of either a bicultural or uni- 
cultural nature enhances the associative inde- 
pendence of translated equivalents in the bilin- 
gual’s two languages, as compared to experi- 
ence in fused acquisition contexts. 

Table 3 makes it clear that there were no dif- 
ferences between groups in facility of switching 
from one language to the other, a finding con- 
trary to predictions. There is no immediately 
obvious interpretation for this lack of differ- 
ence between groups, other than the possible 
lack of sensitivity of our measuring device. 
Accepting the finding as reliable, one may 
speculate that Ss in both groups have had 
equal experience in translating but possibly 
different types of experience. The fused Ss, by 
definition, have learned their two languages in 
contexts that encourage translation. On the 
other hand, a bilingual who has contacts with 
two separate linguistic communities may often 
be called on .o recount his experiences with 
members of both communities. In a sense, he 
becomes the liaison between two linguistic 
communities and he may learn to use concepts 
which have meaningful equivalents for mem- 
bers of both communities. 

When requested to translate, Ss in the two 
groups demonstrate similar facility to switch. 
In view of the other findings, however, the 
fused and separated Ss appear to make dif- 
ferent use of the translation process. The fused 
Ss made use of the interpolated translations in 
the learning task and accordingly showed im- 
provement in their retention scores. It is pos- 
sible, although no attention was paid to this 
feature in the collection of data, that the sep- 
arated Ss were less aware that actual transla- 
tions were being interpolated than were the 
fused Ss. Stated differently, the fused Ss may 
have been more set to translate than were the 
separated Ss. 

In conclusion, we have been able to demon- 
strate, within the limitations of the small 
sample of Ss studied, that language-acquisition 
context does influence the functional separa- 
tion of the bilingual’s two languages in certain 
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TABLE 3 
SWITCHING FACILITY COMPARISONS 
Mean | 
. la Trans- Var ‘ 
Contexts N | \ation | iance | test voles 
| Speed* | | 
= |——|— 
Fused (F)» | 7] 125.08 | 112.43] F-S <1 | ns. 
Separated (S 23 125.69 288.65 | F-B:| <1 | ms 
| 
Bicultural (B 15 124.59 303.42 | F-U:| <1 | ns 
Unicultural (U) | 8 127.76 B-U: | <i n.s. 


254.62 


* Entries, in hundredths of a second, are group means. The 
larger the mean, the slower the group is in translating into French 
and into English. 

© Two Ss were not available for this test 


respects. The theory of coordinate and com- 
pound language systems has been given em- 
pirical support and the defining characteristics 
of these systems have been extended. The co- 
ordinate bilinguals, in contrast to the com- 
pound bilinguals, appear to have more func- 
tionally independent language systems. If their 
coordinateness has been developed through 
experience in culturally distinctive contexts, 
they will have comparatively different mean- 
ings for concepts translated into their two 
languages. Coordinate and compound bilin- 
guals, however, appear to have equal facility in 
switching from one language to the other. 


SUMMARY 


Bilinguals were classified as having learned 
their two languages in either separated or fused 
contexts. It was hypothesized that experience 
in separated as compared with fused language- 
acquisition contexts comparatively enhances 
the effectively separated use of the bilingual’s 
two languages. 

It was found that experience in separated 
contexts comparatively increases the associa- 
tive independence of translated equivalents in 
the bilingual’s two languages. If the bilingual 
has learned his two languages in culturally dis- 
tinctive contexts, the semantic differences 
between translated equivalents is compara- 
tively increased. There was no difference found 
in facility to switch from one language to the 
other that can be attributed to contextual in- 
fluences. 

The findings were related to the theory of 
compound and coordinate bilingual language 
systems. 
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been written about the 

conscious ideals and unconscious 

motives of both Germans and 
Americans (3, 4, 5, 7, 8, 9, 11, 13, 14, 16), but 
little of it has been based on factual evidence 
systematically gathered for comparative pur- 
poses. The present study does not pretend to 
be complete or definitive because it is based 
on a very limited sample of upper middle-class 
male adolescents, but it does attempt (a) to 
see the value systems of the United States and 
Germany as functioning wholes and to com- 
pare them without prejudice as to which is 
better, and (6) to make the comparisons on 
the basis of actual measurements taken from 
individual subjects. 

While it is fair to state that it was the in- 
tention to make the comparison without prej- 
udice, it is also true that no study can be made 
without preconceptions. So it would be well 
to state these in advance as explicitly as 
possible: the general assumption which guided 
the study was that every society has to strike 
some kind of a balance between freedom and 
order, between the wishes of the individual 
and the obligations of social living, between 
egocentricism and sociocentricism. Such an 
assumption has particular value for this study 
for two reasons. First, two of the best-de- 
veloped measures of human motivation—n 
Achievement and n Affiiation—fall easily 
into one orientation or the other: n Achieve- 
ment into the egocentric and n Affiliation into 
the sociocentric orientation. And, second, 
previous research as well as popular stereo- 
type has tended to picture Americans as 
“individualistic” (with over-tones of “other- 
direction” and conformity, to be sure) and the 
Germans as “sociocentric” (again with a 
highly individualized sense of honor). So, 


UCH has 


1 This study was completed when all the authors 
were connected with Wesleyan University and is based 
Sturr: A comparison of the values cf German 
and American secondary school leaders and non- 
leaders. Unpublished A.B. thesis, Honors College, 
Wesleyan University, Middletown, Connecticut, 1955. 

2 Now at the University of Rochester. 

3 Now at the University of Mainz. 
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while this study was designed to be an ex- 
ploratory inductive enterprise, it was guided 
by the very general notion that measures 
should be used that would contrast the two 
cultures on the obligations to self and to 
society, although, as the ensuing section will 
show, many measures were also taken that did 
not fit precisely into this scheme. 


METHOD 
Measures 


The materials used consisted of a picture-story test 
of n Affiliation and n Achievement and a question- 
naire. The method of deriving n Achievement and n 
Affiliation scores from brief written stories to pictures 
has been fully described elsewhere (12, 17). In this 
study, the pictures used to elicit stories related to 
achievement were: a male adolescent at a desk, a 
foreman and worker in a factory, a “father” talking to 
his “son” with a farm background, and a man working 
at a desk in his office. The pictures used for measuring 
n Affiliation were: seven men seated around a table, a 
mother-son scene (TAT 6BM), a man standing alone 
looking out through a door, and a figure underneath a 
lamp post at night (TAT 20). The instructions for 
eliciting the stories were standard (12) and so far as 
possible identical in the two countries. The coding of 
the German stories was done in German using a transla- 
tion of the English manuals for scoring n Achievement 
and n Affiliation. 

The questionnaire consisted of 38 items of the F- 
scale type (1) to which the subject (S) was to express 
the degree of his agreement or disagreement on a scale 
from +3 (for complete agreement) to —3 (for complete 
disagreement). After the questionnaires were returned, 
the scale was changed to run from +1 for —3 to 7 for 
+3, as is common practice, all the results being re- 
ported in the latter terms. Three basic orientations 
influenced the choice of items for inclusion in the 
questionnaire—the self vs. society dimension already 
mentioned, previous speculative or research findings on 
differences between Germans and Americans, and the 
framework set up by F. Kluckhohn (10) for the com- 
parative study of values. The variety of sentiments 
covered is illustrated by the fact that at least two items 
touched on each of the following ideas: achieved vs. 
ascribed status, loyalty to family and country, emo- 
tional control or pride, self-development, kindness and 
love, community service, affiliative values (friendship), 
achievement values, other-directedness, man’s rela- 
tionship to nature (e.g., as “‘over” or “under” nature), 
the importance of fate and destiny, orderliness and 
rational planning, and will power. In some cases, a 
single item was believed to tap more than one of these 
ideas so that the categories were in no sense conceived 
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as mutually exclusive. In fact, since the categories will 
play very little part in our analysis, the chief point in 
mentioning them is to demonstrate how wide a net was 
cast and to make it easier for subsequent investigators 
to find obvious areas for comparison that were over 
looked. 

There were also three open-ended questions at the 
end of the schedule. The first asked the student to list 
his extracurricular activities (clubs, hobbies, sports, 
etc.), which were then simply counted and coded as 
involving either group or individual activities. The 
former included all clubs and team sports (soccer, foot 
ball, basketball). and the latter individua! hobbies 
(e.g., stamp-collecting) and individual sports (hiking, 
playing chess or tennis). The second open-ended ques 
tion asked “‘What are the three things you would most 
like to teach your children?” and the answers were 
coded as being either egocentric (e.g., to do well in 
school, to appreciate music, to enjoy life 
centric (e.g., to be kind, respect others, to be loyal to 
one’s friends). The final open-ended question asked 
them to list the “six students in your class who you 
think are most likely to be leading members of their 
community 30 years from now.” The purpose of this 
item was to break the students roughly into two 
groups—one consisting of “leaders” and the other of 
“nonleaders” at least from the point of view of their 
peers. In all classes tested, both in Germany and the 
United States, it turned out that roughly half the Ss 
received three or more votes and so this cutting point 
was used to divide the Ss into “leaders” and “non- 
leaders,”’ although it should be clear that the categori- 
zation is very crude, and that the term “leadership”’ is 
being used in a very restricted sense in this context 


or socio- 


Subje cts 


The Ss used were males nearly all between the ages 
of 16 and 19 (mode 17, median nearly 18 in both 
countries). Furthermore, all of them belonged to what 
might be called the educational élite who were preparing 
to go on for higher education either in a college or 
university. The nature of the sample was dictated to 
some extent by the way the German educational system 


TABLE 1 


DISTRIBUTION OF OCCUPATIONAL LEVELS OF FATHERS 
or Boys 1n UNrTED STATES AND GERMAN SAMPLES 


U.S. Germany 
(N= (N= 
Occupational Level 70)" 85)" 


32% 24% 


Major professional and business 
executive 
Minor professional and medium 26 39 
business 
6 and 5 combined 58% 63% 
Clerical and sales 24 22 
Skilled mechanical and public 17 
service workers 
and 1. Semi and 
workers 
4 to 1 combined 


unskilled 


* Information not available on all fathers 
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is organized. Beginning at the age of 10, German 
students who are going on to the university tend to be 
with lesser educational or 
occupational goals Vit 
telschule), so that a sample of German students tested 
in the “higher” schools around the age of 18 trend to be 


than one would find in 


segregated from students 


who go to Volksschule or 


a more highly selected group 
the ordinary United States high schoo! senior class. 
Since it was difficult to get a cross-section of German 
students at this age, it was decided to limit the compari- 
son to the relatively élite group to be found in German 
higher schools. Fifty-one students were obtained from 
a Neusprachliches Gymnasium and 36 from an Alt 
spz-achliches Gymnasium, in Kaiserslautern, Germany 
To match this sample, Ss were drawn from college 
preparatory classes in a high school in Utica, New York 
(N = 35) and from a New England private school (N = 
39). The private school students were tested to provide 
a group more nearly the 
Altsprachliches Gymnasium which emphasized a classi 
cal curriculum, and also to make certain that the sample 
of United States Ss was as highly selected as the German 
one. They also make the American sample unrepre 
sentative because private schools exist in large numbers 
only in the northeastern section of the United States, 
but the purpose of the study is not primarily to get 
representativeness of each culture but to contrast the 


comparable to those in 


value orientations of comparable young male élites 
within the two cultures. The testing was done in De 
cember, 1954 in the United States and in March, 1955 
in Germany. 

The attempt to obtain boys in the two countries from 
the same backgrounds was quite successful, as Table 1 
shows. As expected, the students came predominantly 
from the upper middle class and therefore constitute 
an “‘élite’’ group in each country, but the proportion of 
them from different backgrounds was roughly the same 
in both countries. The difference between Levels 6 and 5 
is not significant and, in any case, is more apparent than 
real because there were 12 students in the German 
sample whose fathers were teachers in Héhere Schule. 
According to the coding system, high school teachers in 
the American sample were classified as minor profes 
sionals and placed in Level 5, but the fact of the matter 
is that such teachers in Germany have a considerably 
higher status than high school teachers in this country 
and probably belong at the very least somewhere 
between occupational Levels 6 and 5. If, to take account 
of this, half of them were classified in Level 6 or Levels 
6 and 5 are combined, as in Table 1, then the distribu- 
tions of backgrounds of the boys in the two samples 
become very nearly identical. Furthermore, the per- 
centages of Protestants and Catholics were around 65% 
and 28% in both groups. In short, any nationality dif 
ferences found cannot readily be attributed to dif 
ferences in the socioeconomic level of the two groups of 
Ss, or to age differences (since they were matched for 
age), or to differences in educational plans (since they 
were all preparing for college), or to differences in 
religious background 


RESULTS 


The mean score for each of the 38 items on 
the questionnaire was computed for both the 
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German and American Ss.‘ About one half of 
the items revealed differences between the 
means for the two nationality groups that were 
significant at or beyond the .05 level. It would 
be easy and tempting to work out post hoc 
interpretations of these differences, but prob- 
ably unwise to do so. Even though great care 
was taken to make the German translations as 
precise as possible, it is still highly likely that 
with emotionally toned sentiments such as 
those used in the questionnaire, the immediate 
impact of the statement would not be exactly 
the same on German and American audiences. 
So at least some of the differences might be 
attributed to differences in the way the sen- 
timents were expressed. Furthermore, several 
of the items that on the surface looked as if 
they should be tapping the same sentiments 
yielded contradictory results. For example, 

1e Germans agreed much more with Item 
), “There is no satisfaction in any good 
without a companion” (p <.01), but agreed 
less (p <.05) than the Americans with Item 2, 
“No sane, normal, decent person would ever 
think of hurting a close friend.”’ Both items 
deal on the surface with the importance of 
friendship, but the Germans subscribe more to 
the former and the Americans to the latter, 
which incidentally is one of the items drawn 
(without the final words “or relative”) from 
the F scale on which Germans generally score 
higher (4). With such variable results, how 
can one draw general conclusions as to which 
country values friendship more highly? 

For these reasons it was decided to perform 
a factor analysis on the questionnaire to pick 
out those items which belonged together in 
each country. Such a procedure ensures that 
the items to be combined are chosen before- 


} 
i 
( 
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hand on the empirical basis of covariation, 
rather than afterwards, in order to explain the 
particular grouping of results on individual 
items. Factor analysis also tends to minimize 
the importance of the translation factor by 
yielding groups of items that must have been 


‘These average scores together with the German 
and American version of each question have been de 
posited with the American Documentation Institute. 
Order Document No. 5423 from ADI Auxiliary Publi 
cations Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress 
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reacted to in somewhat similar ways in the two 
countries if matching factor patterns can be 
obtained. A correlation matrix was set up for 
each country® consisting of 36 of the 38 ob- 
jective items, the two measures of motivation 
(n Achievement and n Affiliation), plus the 
number of egocentric virtues mentioned as 
being desirable to teach children and the total 
number of activities the S listed. Inspection of 
the matrices showed that practically none of 
the a priori clusters hung together. That is, 
items that had been written to tap the same 
sentiments were not intercorrelated consis- 
tently in both countries. So a factor analysis 
became all the more urgent. It was clear from 
inspection that certain items showed what 
appeared to be only random relations with 
other items. It was therefore decided to in- 
clude in the factor analysis only those items 
whose average intercorrelation with the other 
items departed from zero at the .05 level or 
better, or those items whose correlations with 
other items showed a rate of dispersion ex- 
ceeding the .05 level of probability. However, 
in order to make matching of factor patterns 
in the two countries easier, any item was in- 
cluded in both analyses if it met either cri- 
terion of significance in ei/her country. Ro- 
tation to simple structure was performed 
blindly on the matrix of 19 items chosen by 
this method and yielded results readily inter- 
pretable in terms of two matching factors plus 
a set of items that shifted from one factor to 
the other in the two countries, as shown in 
Table 2. Factor A includes five items in each 
country that saturate +30 or more, most 
of which appear to deal with striving, ra- 
tionality, and planning. The two most highly 
saturated items are from the so called v 
Achievement scale (6), which reflects for 
the most part the extent to which a person 
consciously subscribes to achievement senti- 
ments for himself. For simplicity, this factor 
has been named “rational striving” and per- 
haps even the one deviant item dealing with 
the respect due an older man can be thought of 
as belonging under this heading in the sense 
that the older man (father figure?) is a |.ind of 
symbol or identification figure for people with 
these sentiments. Factor B contains four items 
that saturate fairly highly in both countries. 


5 We are especially indebted to Ralph Haber who 
ran off the correlations on a high-speed computer. 
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rABLE 2 
MatcuinG Factor Patrerns oF ITEMS FROM A 
QUESTIONNAIRE ADMINISTERED TO UNITED STATES 
AND GERMAN MALes or Hicu Scnoot AGE 

(Saturations less than +29 omitted 
Factor A Factor B 
+c | Ger- 7 Ger 
U.S. |oony | U-S- | ony 

Factor A: Rati 

auty 


34. I work like a sla 


ver be asked to 
ess he is told why 
s asked to do it 


h world conditions 


rd lets tomorroy 


take 


Factor B 
1 I 


rtog 
thing than to 


18, There is 


ask a fav 
hardly 

than a person who d 

a great love, gratitud 


spect for his parent 


Items with uni 
40 Egocentrix 


more often than sociocentr 


jue pattern 


values ment 


irable for children 
22. It would irritate me very much 


to have a watch or clock which 
was off by several minutes every 


day or 
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TABLE 2 (Continued) 


Factor A Factor B 
Ger- 


us. |S! us. 
many 


many 

Items with unique patterns 
24. My political 

] 


swayed by editor 


opinion is easily 
als I read 
When I see a man working it 
ordinary unskilled job, I 
wonder why he doesn’t try t 
better in life 

If you get bad news it is better 
to hide what you feei 


as if you didn’t care 


and behave 


A man who gives his life to I 


ing others in trouble is better 
than a man who spends his life 


trying to work out schemes 80 


that there wiil be less trouble ir 


the world 


It seems to reflect a somewhat passive or 
resigned attitude toward life with a touch of 
cynicism, an attitude perhaps best summed up 
in a shrug of the shoulders and the French 
phrase “‘c’est la vie.” The interesting thing is 
that four items switch from being saturated on 
Factor A in Germany to being saturated on 
Factor B in the United States. All of them seem 
to involve a kind of formal concern for the 
“proprieties,” for the abstract code of polite- 
ness and obligation to others. As would be ex- 
pected from this, score on the C items is 
correlated +.40 (p < .05) with Factor A score 
in Germany but only .06 in the United States. 
In contrast, the score on the C items is cor- 
related +.28 (p < .05) with the score on Fac- 
tor B in the United States and —.13 in Ger- 
many. In other words, explicit concern for 
“decency” goes with resignation and cynicism 
in the United States and with rational striving 
in Germany. The difference is nicely high- 
lighted by the saturations for Item 40, which 
shows that frequent mention of egocentric 
virtues is negatively related to Factor A in 
Germany and positively related in the United 
States; or to put it more understandably, 
frequent mention of sociocentric virtues (e. g., 
loyalty, kindness, honesty) is positively as- 
sociated with rational striving (Factor A) in 
Germany but negatively in the United States. 

Some of the remaining items with unique 
factor loadings in the two countries also add to 
the clear-cut total picture that emerges. For 
example, perhaps having an inaccurate watch 
is irritating to Germans high in Factor A be- 





OBLIGATIONS 


cause it keeps them from being orderly in ful- 
filling their obligations, whereas in the United 
States where orderliness is not a part of the 
Factor A pattern, being irritable about a 
watch is not part of the self-picture of the 
“rational striver.’’ In short, in Germany Factor 
A is definitely sociocentric: conscientious 
striving is included in a network of idealistic 
sentiments relating to decent social behavior. 
In the United States, rational striving is part 
of no such framework. On the contrary, senti- 
ments of idealistic concern for “decency” are 
related to a kind of “‘un-American”’ passivity, 
resignation, and cynicism. In passing, it is per- 
haps worth noting that two of the C items are 
from the F scale (one slightly modified) and 
the other two have an F-scale flavor, a fact 
which that F-scale-like sentiments 
may be part of a deviant, rather cynical attitude 
toward life in the United States, whereas in 
Germany they are organized into an idealistic 
pattern dealing with order, efficiency, and 


suggests 


striving. 

More light on the German-American con- 
trast is shed by Table 3 which shows the mean 
scores for each nationality group on each of 
the three sets of items derived from the factor 
analysis, the two measures of motivation, and 
the two kinds of extracurricular activities in 
which the students said they were engaged. 
[he Germans score significantly higher on 
Factor A (“‘rational striving’’) and the C items 
“concern for decency’) as expected, and as 
the first line in Table 3 shows, the difference 
cannot be attributed to a general tendency of 
the Germans te acquiesce more to items of this 
sort. That is, since the Germans tend to agree 
more with all three sets of items included in 
Table 3, it was necessary to check their ac- 
quiescence to all items in the questionnaire to 
make cure that the differences on these sets of 
items were not due simply to a stronger ten- 
dency for Germans to go along with or ac- 
quiesce in emotional sentiments of all sorts 
(2). No such tendency appeared. The German 
students further list themselves as being en- 
gaged in more outside activities of an indi- 
vidual nature—that is, in activities that do 
not require group organization or participation. 

The Americans, on the other hand, score 
higher on n Achievement and much higher in 
the number of extracurricular group activities 
that they list. Since the difference in n Achieve- 
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TABLE 3 
MEAN Scores FOR VALUES, MOTIVES, AND OUTSIDE 
Activities oF U. S. AND GeRMAN MALEs AGED 


Acquiescenc 36 items)* 
M 
SD 
Factor A: (5 items) “Rati 
striving 


M 


Sp 


M 
C items 4 items 
for de 
M 
SD 
A i. 
M 


ency 


SD 

1 Affiliation 
M 
sp 

Group activities 


M 


9 2 


SL 78} 1 


* Two items dropped, one because it was too complex to be 


understood, the other because nearly everyone in both countries 


agreed fully with it 


ment score is of considerable importance in 
subsequent interpretation, and since it might 
have arisen from coding differences or the 
“pulling power” of the pictures used, it was 
carefully checked by comparing subcategory 
frequencies for each of the pictures used. The 
German students gave less achievement 
imagery for each of the pictures used, the 
smallest difference being for the picture of the 
student and the largest for the “father-son” 
picture, which may have been somewhat inap- 
propriate for the Germans since the cattle in 
the background rather suggested to them an 
American scene. Nevertheless, even among the 
Americans this picture evoked the least amount 
of achievement imagery, and even with it 
omitted, the difference between the two groups 
is still significant. So it would be hard to at- 
tribute the results to differences in the ap- 
priateness of picture cues for the two groups. 

However, it was found that the German 
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scorers had used the category, ‘“‘task imagery,’’® 
significantly more often than the American 
scorers had because, being less familiar with 
the achievement motive scoring system, they 
tended, when in any doubt at all, to use this 
category, whereas the American scorers had 
been instructed to use it only as a last resort. 
Consequently, the German mean score might 
be lower than the American because more real 
instances of achievement imagery had been 
classified as doubtful in Germany, which would 
also mean according to the scoring system 
that further subcategories could not be scored 
in such instances. For this reason, a random 
sample of 46 of the German protocols was re- 
scored in German in the United States by some- 
one with considerable experience in scoring 
n Achievement in English. The correlation 
between these scores and the original scores 
was .70, which is lower than desirable, prob- 
ably because of the difference in the use of 
the “task imagery” category. Still, the revised 
mean score for the German sample was ac- 
tually Jower (2.30) than the one given in Table 
3, indicating that most of the instances of 
“task imagery” in the German sample should 
really have been classified as unrelated to 
achievement at all. So there seems little doubt 
that so far 2s this comparison is concerned, the 
German students are significantly lower in n 
Achievement score.’ 

The American boys, then, think more fre- 
quently in terms of doing well, a finding that 
is also supported by the greater frequency with 
which they mention egocentric rather than 
sociocentric virtues as desirable to teach to 
children. “‘Egocentric”’ is, of course, not meant 
in an evaluative sense, but refers to the de- 


velopment of individual capacities (to be in- 
telligent, to appreciate music, to enjoy life, 
Unfortunately, the German coders did 


etc.) 
not force all virtues listed either into an ego- 
6 In the scoring system for n Achievement (12) “task 
“unrelated” (= 
“achievement-related” (= +1) imagery. 
protocols would be scored without 


imagery” has a value (= 0) between 
-1) and 
7 Ordinarily suc 
knowledge of which group they belonged to but since 
they were written in different languages, this procedure 
was not possible in tne present case without translating 
one set into another language. In our view, translation 
than the 
procedure followed here, especially since coding for n 
Achievement by highly trained scorers is ordinarily 
done with a re-score reliability of .90-.95, which does 
not leave much room for unconscious bias to operate. 


was more objectionable methodologically 
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centric or sociocentric category as the Ameri- 
can coders did, but a comparison of the 
differences in frequency of mention of the two 
kinds of virtues in the two countries is still 
legitimate. That is, in both countries egocentric 
virtues were mentioned more often, but the 
difference in favor of them over sociocentric 
virtues was significantly larger (¢ = 2.52, 
p < .02) in the United States than in Germany. 

If the Americans are so concerned with 
achievement, what keeps them in line? What 
prevents unbridled individualism? How are 
they to fulfill their obligations to society? The 
answer apparently lies in their much greater 
participation in group activities. They are not 
brought up on sentiments of obligations to 
others as the Germans are, but from kinder- 
garten on they regularly participate in many 
more extracurricular functions of a group 
nature. In fact, by far the most impressive 
result in Table 3 is the low number of group 
activities listed by the Germans (about 1, on 
the average) as compared with the Americans 
(about 5, on the average). In these activities 
the American student must learn a good deal 
more about getting along with other people 
and doing things cooperatively, if these clubs 
are to function at all. If there is any doubt on 
this point, there is an interesting pair of items 
correlated positively both in Germany (+.20 
and the United States (+.27) on which the 
two groups of students differ considerably. 
They are the following: 


Mean scores 
U.S Germany 


3.30 2.37 


#9. Thenegative opinion of others 
often keeps me _ from 
seeing a movie or 


play 
I had planned to attend 
15 #24. My political opinion is 
easily swayed by editorials 

I read 
Both groups tend to disagree with these items, 
but the Germans much more than the Ameri- 
cans (f = 3.77, p <.001). What is clearly sug- 
gested here is the American’s greater sensitivity 
to public opinion, to being guided or directed 
by others as Riesman has so persuasively put 
it (15). The American’s obligation to develop 
himself, to achieve, is checked and channelled 
by the opinion of others in the groups in which 
he participates so that individualism does not 

run rampant. 

The German is also “individualistic” but in 
a somewhat different way. He subscribes more 


‘ 
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than the American to sentiments about rational 
striving (Factor A) and engages in more in- 
dividualistic outside activities. Furthermore 
he agrees more with two other items on the 
questionnaire which shed light on the nature 
of this attitude toward the self even though, as 
evidence, answers to the items do not rank 
high because they are uncorrelated in either 
country. The items are as follows: 

Mean scores 

U.S. Germany 


28 4.88 #7. It is better to go without 
something than to ask a 
lavor 

If you get bad news, it is 
better to hide what you 
feel and behave as if you 
didn’t care 


4.92 #15 


In both cases the Germans agree much more 
with the item (~ <.001) than the Americans 
do, suggesting that they have more pride in 
self-control and perhaps think of the self as 
having the will power to exercise restraint in 
such cases. How is this kind of self to fulfill its 
obligations to society? Clearly not by “learn- 
ing from acting cooperatively in groups” as in 
America but from a strong sense of social 


obligation, of one’s duty to others and of the 
necessity to put such an obligation above self- 


ish interests. 

Evidence was also sought for linkages be- 
tween the two motives measured and the three 
groups of value items or the extracurricular 
activity patterns, but none was found. In cor- 
relating the two motive scores with the three 
types of value scores (Table 3) and with the 
total number of individual or group activities 
in each of the two countries (16 different 
correlations) only one correlation reached the 
.05 level of significance and it might well have 
been obtained by chance. The correlation was 
between n Affiliation and Factor A (“rational 
striving’) in Germany (rf = —.22). It gains ad- 
ded significance only because the same correla- 
tion in the United States was +.21 and the dif- 
ference between the two correlations is highly 
significant. If anything, this result would 
appear to mean that spontaneous concern for 
others (n Affiliation) may make it harder to 
subscribe to an ideological code of individual- 
istic achievement in Germany, whereas n 
Affiliation may be positively related to rational 
striving in the United States because achieve- 
ment values there are more connected to work- 
ing with others. Nevertheless, the overwhelm- 
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ing impression of this analysis is that motive 
scores (as obtained from fantasy) and value 
scores (as obtained from questionnaires) are 
not correlated and are essentially independent 
variables. Consequently, great care should be 
taken to specify the method of measurement 
when one speaks about such things as an 
“interest in achievement,” which may refer 
either to the n Achievement score and its 
correlates or something like Factor A with its 
correlates. 

Finally, an analysis was made to see if the 
students nominated by their peers as future 
outstanding citizens differed in any systematic 
ways from those not so nominated. For the 
most part they did not. As far as item or 
factor differences on the questionnaire are 
concerned, the two groups of “leaders” did 
not differ significantly from the “nonleader” 
groups within either country or from each other 
between countries. Only one finding of possible 
future interest emerged: in both countries 
nominated “leaders” had higher average n 
Affiliation than “nonleaders” (p <.05), a fact 
which suggests that among peer groups, even 
across nationality differences, it is the spon- 
taneous interest of individuals in others 
(n Affiliation) that tends to make them “stand 
out” as potentiai leaders in the perception of 
the group members. Further interpretation of 
these results is not justified in view of the large 
number of insignificant findings and the pos- 
sibility that such a “nomination” procedure 
may not actually select “leaders” in a very 
stable or meaningful way. 

DISCUSSION 

The findings of the present study may be 
most simply discussed in terms of the follow- 
ing table which has been organized in terms of 
the double obligation of “the individual in 
society” to fulfill some of his own needs and 
also to satisfy some of society’s needs: 


Obligation to self Obligation to society 


High participation in 
group activities, 
greater sensitivity to 

opinions of 


United 
States 


High n Achievement, 
concern for egocentric 
““self-development”’ 
virtues the 
others 
Greater concern over 
obligations to an 
idealistic code of 
decency (governing 
interpersonal be- 
havior) 


High v 
(concern 


Achievement 
for rational 


Germany 


striving), greater par- 
ticipation in indi 
vidualistic activities, 
more 
power of self to exer 
cise restraint 


emphasis on 
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Presumably every culture develops more or 
less patterned solutions to these two problems 
if it exists for long, and the imdividual member 
learns these norms and reflects them in his 
behavior and attitudes. The point to notice is 
that according to this analytic scheme, both 
obligations must be at least in part fulfilled by 
a majority of its members if the society is to 
function at all. If society should demand too 
much of its members, it should eventually 
collapse through apathy or rebellion because its 
members can no longer satisfy their needs. On 
the other hand, if society is neglected through 
exclusive concern with individual satisfaction, 
it should eventually collapse through anarchy. 
No one is interested in the welfare of others and 
society degenerates into the “war of all against 
all.” Between these two theoretical extremes, 
there should be ail sorts of ways of structuring 
the two sets of obligations which produce 
working social systems of greater or lesser 
efficiency. The data of the present study sug- 
gest that the scheme is of some value in pre- 
senting the somewhat different ways in which 
morality is structured in the United States and 
Germany. In the United States a high spon- 
taneous interest in achievement is counter- 
balanced by much experience in group ac- 
tivities in which the individual learns to 
channel his achievement needs according to 
the opinions of others. In Germany a strong 
sense of self as a striving, controlling entity is 
offset by an equally strong sense of obligation 
to a code of decency. Interestingly enough, the 
American “value formula” 
largely unconscious or informally understood, 


appears to be 


as compared to the German one, at any rate. 
That is, the American boys have higher n 
uncorrelated with a 


Achievement, which is 


conscious value placed on striving, and they 


participate in many group activities without, 


it is fairly safe to say, having a very explicitly 
formulated moral justification for such be- 
havior. Yet we are arguing that functionally 
group activities have just such a moral basis 
in checking excessive individualism. For the 
Germans, on the other hand, the whole matter 
appears to be much more consciously worked 
out. The moral code is formal and 
ideological, as other students of German 
character have observed (14). Individuals are 
taught and “know” they have a great obli- 


more 
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gation to the group and strong egos, which 
are, however, capable of making sacrifices of 
personal feelings to meet this obligation. In the 
present study, the German value pattern 
appears most clearly in the sentiments ex- 
pressed on the questionnaire, while the United 
States value pattern appears most clearly in 
the projective material and the kind of activ- 
ity the students engage in. 

The present study is obviously of very 
limited scope. It is based on small samples of 
German and American boys of predominantly 
upper middle-class status. But the general- 
izations drawn from it suggest a much wider 
application to the cultures involved. What 
evidence is there from previous studies that 
our conclusions do or do not apply more 
generally? Has anything been added or sub- 
tracted to what is already known about 
German and American “national character’’? 
In the main, our results are supported by what 
others have found previously, with one possible 
change in emphasis to be noted below. Prob- 
ably the most sensitive clinical or anthropo- 
logical comparison of the two countries was 
made by the social psychologist, Kurt Lewin 
(11), shortly after he’ immigrated to the 
United States from Germany. His observations 
agree at almost every point with our con- 
clusions. He found “‘probably no other people 
as interested in individual accomplishment... 
as the Americans” (11, p. 277). On the other 
hand, he calls attention to the flexibility of 
Americans, to the ease with which they meet 
strangers, or appear to change their “per- 
sonality” (opinions and attitudes) to adjust to 
new situations. In short, he recognized what 
Riesman has called the other-directedness (15) 
of many Americans which clearly appears in 
the data of this study. 

So far as the Germans are concerned, he 
described their 


““ 


them as carrying more of 
specific individual characteristics to every 
situation,” as having a greater sense of their 
own identity as separate individuals than 
Americans do. This kind of “individualism” 
we have also met in the reports of our German 
the additional that the 
self-concept carries with it connotations of 


students with note 
personal striving and self-control. Lewin sees 
the German’s sense of obligation to society 
largely in terms of his tendency to obey orders 
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from higher authority without question. He 
says, for instance, “In Germany it seems to be 
the natural right of the adult to rule and the 
duty of the child to obey”’ (11, p. 270). In this 
respect his opinion is supported by a large 
number of research workers (1, 3, 4, 8, 13, 16). 
Their point of view is perhaps best summarized 
by McGranahan as follows: “The German is 
noted for his unquestioning obedience to 
authority, his failure to exercise individual 
responsibility and act on the basis of inde- 
pendent moral jud» ment” (13, p. 248). 

It is at this point that cur study differs 
most from previous ones, and even here the 
difference is not so much one of fact, but of 
interpretation. There seems to be no reason to 
doubt the evidence that Germans are more 
obedient. The problem arises when attempts 
are made to explain why they are so obedient, 
even when such obedience causes great per- 
sonal suffering or pain to others. Explanations 
have frequently been given in terms of such 
motivating factors as “identification with the 
aggressor,”’ “‘authoritarian-submission,”’ “‘sado- 
masochistic trends,” and the like. Al! such 
explanations suffer from doing little more than 
naming the behavior (i.e., Germans obey be- 
cause they like fo submit) and from interpreting 
it in ways which, however acceptable they may 
be to outsiders (particularly if they dislike 
Germans), do not make sense to Germans as 
the rationalizations that at least consciously 
lead them to obey. The present study does not 
pretend to get at unconscious factors behind 
German behavior, but it does shed some light 
on why they behave as they do from their point 
of view. 

To point up the issue as sharply as possible, 
compare the average agreement with the 
following item by our United States and 
German boys: 

Mean “cores 

U.S Germany 

3.50 5.12 #35. Achild should never be asked 

to do anything unless he 


is told why he is asked to 


do it. 


German boys want explanations significantly 
more (p <.01) than American boys do. On the 
surface at least, this finding would appear to 
contradict the notion that Germans are noted 
for ‘unquestioning obedience,” but one must 
distinguish between behavior and the attitudes 


tr 
uv 
w 


behind it. Suppose we assume that other in- 
vestigators are correct, that German boys do 
obey more promptly than American boys, then 
our problem is to understand why they demand 
explanations for orders. Here lies the key to 
the difference between our interpretation and 
some others. We believe they obey not because 
they in some sense like obedience and there- 
fore “obey blindly” but because they have 
some reason, preferably some idealistic reason, 
for obeying. Without the reason, without the 
ideal, obedience is meaningless. Hence, we 
believe, the Gerrnan emphasis in our data on 
obligation to an idealistic code of decency. 

To make the contrasting interpretations 
even clearer, consider the following item from 
McGranahan’s study (13) of German and 
American high school students: 


Do you think a boy is justified in running away from 
home if his father is cruel and brutal? 


Three out of four of the American boys said 
they felt the boy was justified in running away, 
whereas the German boys split about 50-50. 
McGranahan is inclined to interpret this 
finding as supporting the hypothesis that 
Germans are more submissive and more used 
to obeying family authority without question 
no matter how cruel it may be. But our pre- 
vious analysis suggests that the item does not 
properly distinguish between what Germans 
do and why they do it. They may in fact be 
more submissive to cruel fathers, but not, if 
our interpretation is correct, because it is good 
per se to submit but because it is a way of 
discharging one’s duty to an ideology of 
proper behavior to others (including parents as 
a very special case). In other words, the 
German boy may understand the question 
about running away from a cruel father in 
terms of whether he is willing to endure per- 
sonal discomfort to follow the code governing 
children’s obligations to their parents, which 
is seen by him to be clearly correct in the 
abstract so that not living up to it for personal 
reasons seems like a form of selfishness. In 
fact, the overriding importance of this code of 
social obligation is frequently stressed in 
German literature by pointing out that it not 
only may cause suffering to one’s self (thus 
overriding selfishness) but to others as well. 
McGranahan and Wayne in an interpretation 
very similar to ours note that in German plays 
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“we find...various murders of beloved 
persons consciously carried out by sympa- 
thetically portrayed characters” (14, p. 454). 
The characters are sympathetic because they 
must sacrifice even their “nearest and dearest” 
in the interests of a “higher,’’ more idealistic 
goal. The same theme appears in a popular 
German fairly tale, Der treue Johannes, in 
which a father must cut off the heads of his 
twin sons to fulfill an obligation to a servant 
whom he has wronged. Apparently, Germans 
learn in this way that the ultimate test of the 
sincerity of one’s commitment to a code of 
idealistic obligation is the extreme to which 
one will go in the sacrifice of personal! interests 
or even the interests of particular others. It is 
in this latter sense that the German concern 
with decency to others is “ideological” rather 
than “personal,’”’ as McGranahan and Wayne 
(14) point out 

Outsiders may well wonder how a people so 
concerned ideologically with what we have 
called “decency” were also capable of so much 
cruelty to others under the Nazis. Many have 
struggled with this problem at deep psycho- 
dynamic levels, but a clue to its solution may 
lie in the two characteristics of the code of 
obligation just mentioned. First, it seems to 
be ideological rather than personal. It applies 
to “others” in abstract, not to “par- 
ticular others,” so that it doesn’t seem so 
inconsistent to sacrifice particular others in the 
interest of an abstract code of right conduct 
designed for the welfare of the whole group. It 
is perhaps worth noting in passing that the 
negative correlation in Germany between n 
Affiliation and Factor A (sentiments of rational 
striving and decency) may reflect the con- 
flict between the ‘ideological’ and the per- 
sonal. It may be harder to subscribe to an 
ideology of personal striving and decency if one 
has much spontaneous interest in particular 
others (n Affiliation) in a country where par- 
ticularistic interests are often thought of as 


the 


conflicting with one’s larger obligations. 
Second, cruelty may actually appear as the 
validating sign of the strength of one’s com- 
mitment to the code, as in Der treue Johannes. 
One must show that he can “take’”’ even doing 
horrible things to prove® his commitment to a 


higher end. Since on this point McGranahan 


8 One may even wonder, somewhat paradoxically, if 


the Germans are forced, functionally speaking, to 


Sturr, R. N. Knapp, 
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and Wayne’s study came to the same general 
conclusion based on pre-Nazi data as ours 
does on post-Nazi data, it appears that what 
the Nazis did was to play up a tendency to 
idealistic sacrifice of personal interests, even of 
affiliative ones, always present in German 
character. 

In the main then, our conclusions are sup- 
ported by other students of German and 
American character. We differ chiefly in see- 
ing German obedience as a response that at 
least from the internal point of view is dic- 
tated not by a liking for obedience but by a 
strong belief in the importance of sticking to an 
abstract, ideological code of right conduct in 
society. As confirmation of this alternative 
interpretation, McGranahan and Wayne (14) 
found that obedience is mot always good in 
German plays: but the one time rebellion is 
legitimate is when it is in the interest of some 
higher good of society as a whole. In short, it 
is not obedience per se which is good. It is 
seen as instrumental to an ideology of right 
social behavior that may on occasion demand 
rebellion 


SUMMARY 


A questionnaire covering various value 
sentiments and a picture-story test for measur- 
ing n Achievement and n Affiliation were 
administered to 87 German and 74 United 
States male students preparing for college who 
had been carefully matched for age (around 18 
years) and socioeconomic background (largely 
upper middle class). A factor analysis of the 


questionnaire results yielded two factors that 
could be matched in the two countries and a 
third set of items that shifted from one factor 


to the other across countries. The “switch- 
over” items which dealt largely with decency 
and obligation to others were associated with 
Factor A (“rational striving”) in Germany 
and were more strongly accepted by the Ger- 
mans than by the Americans, as were also the 
Factor A items. In the United States, the 
switch-over items were associated with Factor 
B (“resigned cynicism’) suggesting that 
acceptance of statements about strong obli- 


emphasize self-sacrifice so much just because they have 
such a strong sense of self. Perhaps the problem is less 
acute for the American whose sense of personal identity 
sense that he talks more 
#15) and is more easily 
#24). 


is less well-developed in the 
freely to others (Items #7, 
guided by others (Items #9, 





OBLIGATIONS TO SELF AND SOCIETY 


gations to others tend to be associated in the 
United States with a kind of deviant ‘‘un- 
American” Further an- 
alysis showed that the American students were 


passive resignation. 


significantly higher than the Germans on n 
Achievement, on participation in group ac- 
tivities, and in the extent to which they said 
they were guided by the opinions of others. 
The Germans for their part reported a greater 
number of individualistic outside activities 
and subscribed more to sentiments picturing 
the ego as independent of others. These results 
were interpreted to mean that in the United 
States obligation to self appears as a strong 
unconscious need for Achievement, and obli- 
gation to society as participation in many 
group activities, and that in Germany obli- 
gation to the self appears as a stress on the ego 
as a separate, unique, willing entity and obli- 
gation to society as a moral imperative to 
place obligation to the “impersonal other’ as 
an abstract code above particularistic, “selfish” 
considerations. An attempt is made to inter- 
pret German “authoritarianism” in terms of 
duty to such an abstract code rather than in 
usual, somewhat un- 


terms of the more 


analytical and ethnocentric assumption that 
Germans “like” to dominate and submit more 
than other people. 


REFERENCES 

1. Aporno, T. W., FRENKEL-Brunswik, E., Levin 
son, D. J., & Sanrorp, R. N. The authoritian 
personality. New York: Harper, 1949. 

2. Bass, B. M. Authoritarianism of acquiescence? 
J. abnorm. soc. Psychol., 1955, 61, 616-623. 

BricKNER, R. Js Germany incurable? Philadelphia: 

J]. B. Lippincott, 1943 


4. Coun, T. S., & Karscu, H. Administration of the 
F-scale to a sample of Germans. J. abnorm. soc. 
Psychol., 1954, 49, 471 

5. Commacer, H. S, America in perspective. New York: 
Random House, 1947 

6. DeCuarms, R., Morrison, H. W., Rerrman, W. 
R., & McCLetianp, D. C. Behavioral correlates 
f directly and indirectly measured achievement 
motivation. In D. C. McClelland (Ed.), Studies 
in motivation. New York: App.eton-Century- 
Crofts, 1955. 

De TocgvEviL_e, A. Democracy in America. (1833). 
New York: Knopf, 1945. 

Fromm, E. Hitler and the Nazi authoritarian 
character structure. In T. M. Newcomb, & 
E. L. Hartley (Eds.), Readings in social psychol- 
ogy. New York: Holt, 1947 

Gorer, G. The American 
Norton, 1948 

KLUCKHOHN, F. R. Dominant and substitute pro 
files of cultural orientations: Their significance 
for the analysis of social stratification. Social 
Forces, 1950, 28, 376-393 

Lewin, K. 


people. New York: 


Some socio-psychological differences 
between the United States and Germany 
Charact. and Pers., 1936, 4, 265-293. 

McCLe.ianp, D. C., Atkinson, J. W., CLarK, R. 
A., & Lowe, E. L. The achievement motive 
New York: Appleton-Century-Crofts, 1953. 

McGrananan, D. V. A comparison of social atti- 
tudes among American and German youth. J 
abnorm. soc. Psychol., 1946, 41, 245-257. 

McGrananan, D. V., & Wayne, I. German and 
American traits reflected in popular drama. Hum. 
Relat., 1948, 1, 429-455. 

RresMan, D., wita Gazer, N., & Denney, R 
The lonely crowd. New York: Doubleday, 1953 
ScHAFFNER, B. Fatherlend: A study of authoritarian 

ism in the German family. New York: Columbia 
Univer. Press, 1948 

7. Surptey T. E., Jr., & Verorr, J. A projective 
measure of need for affiliation. J. exp. Psychol, 
1952, 43, 349-356 


Received April 29, 1957 





SEX DIFFERENCES IN PROBLEM-SOLVING PERFORMANCE AS A 
FUNCTION OF ATTITUDE DIFFERENCES! 
GLORIA L. CAREY? 


University of Nebraska 


ROBLEM solving is one of several areas 
of intellectual functioning in which sex 
performance have been 


Usually these differences 


differences in 
noted (1, 3, 5, 6). 
have been reported as incidental to the major 
findings, and very few studies have been de- 
signed specifically to test sex differences in 
reasoning or problem-solving ability. Sweeney 
(7), however, did perform a series of experi- 
which he controlled he 
be responsible for the previ- 


ments in variables 
thought might 
ously noted masculine superiority in problem 
solving. Equating men and women on aptitude 
(mathematical and verbal), visualization, 
mechanical, comprehension or mathematical 
background did eliminate sex differences in 
performance on certain types of problems, but 
1 striking number of highly significant sex 
differences remained. Sweeney concluded that 
these remaining differences occurred on prob- 
lems that required restructuring or involved a 
general reasoning factor. 

The present investigation began with the 
dea that some of the sex differences in prob- 
lem solving could be accounted for by differ- 
ences in nonintellectual factors. The question 
that needed to be asked is: How well do women 
like to solve problems? If problem solving is an 
activity preferred by men and disliked by 
women, there exists a good possibility that sex 
differences in performance are actually a re- 
flection of sex differences in attitude.’ 


Such observations as the relative lack of 


‘ Condensed from a dissertation submitted in partial 
fulfillment of the requirements for the degree of Doctor 
of Philosophy, Stanford University, 1955. The writer is 
indebted to Drs. D. W. Taylor, P. R. Farnsworth, R. 
R. Sears, and Olga McNemar for their help and advice. 
She also wishes to thank Drs. Yvonne W. Brackbill, 
D. L. LaBerge, and R. J. Rhine for their assistance in 
conducting experimental sessions. The investigation was 
carried out under Project NR 150-149 supported by 
Contract N6onr 25125 between Stanford University and 
the Office of Naval Research. Permission is granted for 
reproduction, translation, publication. use, and dis- 
posal of this article in whole or in part by the United 
States Government. 

2 Now at Swarthmore College. 

+ By “attitude” the writer is referring simply to liking 
or not liking to solve problems, or to the valence which 


problem solving has for the individual. This does not 


women in the scientific professions, feminine 
expressions of inadequacy and dislike in prob- 
lem-solving experiments, the cultural 
he “reasoning animal”’ 


and 
stereotype of man as 
point toward a negative attitude toward prob- 
lem solving and the reasoning process, and 
make the following hypothesis plausible: sex 
differences in problem-solving performance 
which are not the result of differences in general 
intelligence, specia! aptitudes, or information, 
are attributable to differences in attitude 
toward problem solving. 

In order to test this hypothesis, studies were 
the following specific 


designed to answer 


questions: 
1. Can a scale be constructed that measures 
attitude toward problem solving? 
2. Is there a sex difference on such a scale? 
3. Is problem-solving attitude related to 


problem-solving performance? 

4. Is an attempt to change problem-solving 
attitude experimentally followed by a change in 
performance? 

5. Do women respond more favorably than 
men to an attempt to improve their attitudes? 
If women have an initially more unfavorable 
attitude than do men, there should be more 
opportunity for improving the attitude of 
women than of men, and hence one might 
expect a greater relative improvement in their 
performance. 

Affirmative the first three 
questions would give support to the major 
hypothesis, but it was felt that such answers 
to the fourth and fifth questions would provide 
even more convincing evidence. In addition, a 
scale throws little light on the question of 
whether performance depends on attitude, or 
whether the reverse is also true—that attitude 
depends on performance. Therefore, the main 
experiment was designed to answer Questions 
4 and 5, which concern a deliberate manipu- 
lation of the independent variable (attitude) 
and measurement of its effect on the dependent 
variable (performance). 


answers to 


mean that it is possible to distinguish clearly between 
attitude and other nonintellectual factors in problem- 
solving success, such as motivation and interest. 


6 





ATTITUDE 


METHOD 
The Likert scaling method was ap- 
ym of several 
yiving research 


SCHLE 


1 titude 


plied to 63 statements which, in the opini 


investigators working on a problem-s 
project, might be related to attitude toward problem 
“T would rather have someone tell 
ition to a difficult | 
There 
liberately to constru *ms which would show a sex 
difference in res n the 
of the responses of 32 men and 100 women in an ele 
mentary psychology course at Stanford University, 36 
d that differentiated between high 
The 36 items 


solving. (Examy 
roblem than to have to 


attempt de 


me the sol 
work it out tor mysell was m 


an item analysis 


basis of 


items were s¢ 


and low scorers at the .05 level or better 
he basis of similarity of content and 
which were 
\ and B of the attitude scale. For 
intercorrelation between the two 


ips of 18 items each, 
referred t rms 
this first sample, the 
at the same time was .94. The in- 


iorms ¢ stered 


tercorrelation fer a second sample of 59 women and 50 
men was .83 
1 


ltitude modification. In order to study the hypothe 


sized efiect of attitude on performance, the following 


experiment was designed so as to measure attitude and 
before and after an attempt was 
itude. Sixteen groups of six sub- 
rether for a total of three hours, 
successive days. Each group 

ven and three women—making 

nen. All Ss were members 
ass at Stanford Uni 
them were sophomores. 
were given one form of the 

1 series of ten problems to solve, 
ne allowed for completion 

ms were ones for which a 
» had appeared in an earlier ex 
th the 
thought to be 


sexes had been matched 
related to 
The second hour consisted of an informal 
the factors involved in 
aim of 
relationship be 


even thou 


performance 


imsion of some ol 


solving success, with the implicit 


a) conveying an appreciation of the 
tween attitude toward problem solving and problem 
encouraging each S to explore 


and recognize his or her own attitude toward problem 


solving success and 


During the third hour, the alternate form of 


} 
scale 


solving 

the attitude and a second, comparable set of ten 
problems were ad Since the 

problems were not exactly equal in difficulty, the order 
tation was reversed for half the Ss. At no time 


ninistered. two sets of 


of preser 
was a reference mace to the real purpose of the experi- 
ment, nor was the subject of sex differences ever intro- 
duced. As far as the 
third hours of the experiment were used to standardize 


the problems and scales, 


Ss were concerned, the first and 


and the second (discussion 
hour was used to get their suggestions for future prob- 


] 


lem-solving h 


resear 
two male and 
with four groups of 


There were four experimenters (Es), 


] I whom worked 


two female, each of 
six Ss each. They administered the attitude scales and 
problems and led the 
observed two demonstration discussions by the writer 


group discussions. They also 


before leading discussions of their own. The use of four 
more generality to the conclusions than 
Two Es of each sex also 


Es allowed 
would be possible with one E 


AND SEX DIFFERENCES IN PERFORMANCE 


provided an oppertunity to test whether a significant 
interaction occurs between sex of E and sex of S. 

If a relatively greater improvement in performance 
were obtained for women than for men, the possibility 
would exist that women benefited more from the prac- 
tice involved in working the first set of problems than 
did men. As a group, women may have had less ex- 
perience with the types of problems used and hence 
may show a greater practice effect. For this reason, 
control groups of 28 women and 20 men were used, the 
procedure being the same as that for the experimental 

ym that t 


groups. In 


Ss, with the he discussion period was 
eliminated for the each control 


group, the order of administration of the problems and 


excepti 


control 


attitude scales was reversed for half the sample, as it 
vas for the experimental Ss 


RESULTS 


The results obtained from the attitude scale 
and from the attitude modification experiment 
are summarized with respect to each of the 
five questions raised earlier. 

1. Can a scale be constructed that measures 
aititude toward problem solving? A reliable 
measuring instrument was constructed. Some 
internal evidence of validity was provided by 
the fact that the items that referred directly to 
interest in or frequency of problem-solving ac- 
tivity were among the most highly differenti- 
ating items in the item analysis. Additional 
evidence of validity was provided by the fact 
that attitude scores and performance scores 
were positively related (see Question 3), which 
indicates that the people who say they like to 
solve probiems or are interested in related ac- 
tivities also solve the most problems. It would 
be desirable to know the frequency with which 
Ss with high attitude scores actually seek or 
indulge in problem-solving behavior, but even 
without such a behavioral criterion of validity 
the contents of the scale and measures of 
validity cited above provide some evidence 
that the scale is actually measuring attitude 
toward problem solving. 

2. Is there a sex difference on such a scale? 
Men received significantly higher scores on the 
attitude scale than did women. In Table 1 is a 
summary of the data for the two samples 
used. 

3. Is problem-solving attitude related to prob- 
lem-solving performance? Correlations were 
computed between the attitude scale scores 
and problem-solving scores for men and women 
separately in both the prediscussion and the 
postdiscussion testing sessions. The eight cor- 
relations, seven of which were in the predicted 
direction, are given in Table 2. The fact that 
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TABLE 1 
Sex DIFFERENCES IN ATTIT 


Sample Mean SD 


First Sample 
Form A 
Men 
01* 
Women 
Form B 
Men 


Women 
Second Sample 
I orm A 
Men 


Women 
Form B 
Men 5 10 
o1* 
Women 8.48 
* An asterisk in this and the following tables indicates that 
a one-tailed test of significance was made, since the direction of 


the difference was predicted 


TABLE 2 
CORRELATIONS BETWEEN ATTITUDE AND PERFORMANCE 
Scores: EXPERIMENTAL GROUPS 
Attitude A and/Attitude B and 
Problems I Problems II 


Session and Sex 


p 


Prediscussion 
Men 24 001* 13° 
Women 24 | .2 08* 

Postdiscussion 
Men “ 

Women 24 | .20 18* 


4 .001* t 07* 
.08* 


only two were highly significant may have been 
due in part to the small NV for each subgroup. 

4. Is an atlempt to change problem-solving 
attitude experimentally followed by a change in 
performance? Performance data for the pre- and 
postdiscussion testing sessions are given in 
rable 3. A significant improvement in the per- 
formance of women indicates that the question 
may be answered in the affirmative for the 
female Ss. 

5. Do women respond more favorably than men 
to an attempt to improve their attitudes? In order 
to answer this question, a two-way analysis of 
variance design was used to analyze the differ- 
ence scores, with classification according to E 
and sex of S.4 Table 4 is a summary of the 


* As a check on the consistency of the imp: »vement 
or lack of improvement for the women in the groups led 


by the four Es, Bartlett’s test for homogeneity of 


TABLE 3 
DIFFERENCES IN PROBLEM-SOLVING SCORES BETWEEN 
FirsT AND SECOND SESSIONS: EXPERIMENTAL GROUPS 


Second | 
Session 


First | 


Diff 


Mean SD 


Men } : 48 
Women ; 2.30) .02* 


TABLE 4 
OF PROBLEM-SOLVING 
GROUPS 


ANALYSIS OF VARIANCE 
DIFFERENCE ScORES: EXPERIMENTAI 


Sum of Var 


source Squares df Est 


Sex difference 10.24 14 2.86, .05* 
Exper. difference 40.45, 3 48 |3.80,.03 
Interaction 8.44 81 79 


Individual differences 312.51 55 


analysis of variance results. Difference scores 
were calculated by subtracting each S’s prob- 
lem-solving score for the prediscussion session 
from that for the postdiscussion session. Since 
the relative improvement of female Ss (mean 
difference score of women minus mean dif 
ference score of men) was 1.25 and 1.22 for 
one female E and one male £, respectively; 
and —.08 and .22 for the other female E and 
male E, respectively, there was obviously no 
significant interaction between sex of E and 
sex of S. Hence, the classification according to 
sex of E was not used. 

The sex difference in amount of improvement 
in performance was significant, women re- 
sponding more favorably than men to the at- 
tempt to improve problem-solving attitude. An 
additional finding was that the sex difference 
in amount of improvement varied according to 
E, a fact that indicates the need for using 
several Es to yield results of some generality. 

The change in performance of the control 
groups was not significant for either sex (¢ = 
.77 and .28 for men and women, respectively). 
This finding indicates that the amount of 
practice represented by the first session did not 
result in a significant improvement in per- 
formance. 

As shown in Table 5, the improvement of the 


variance was applied to the variance estimates of the 
four Es’ groups. Since chi square = 4.49 (p = .19), the 
assumption of homogeneity of variance was not re- 
jected. That is to say, the groups did not differ in 
variability. 
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TABLE 5 
BETWEEN MEAN DIFFERENCE SCORES 
AND CONTROL GROUPS 


Dir FP ERENCES 
FOR EXPERIMENTAI 


Mean Difference 


Exper 


Control 
mental 


Men 

Women 

Difference between 
nen and women 


experimental as compared with the control 
groups was significantly larger for women than 
for men. This finding further confirms the con- 
reached earlier on the basis of -the 
women showed signifi- 
a result of the 


clusion 
analysis in Table 4: 
cantly more improvement as 
discussion than did men. 

A comparison was also made of the pre- and 
postdis« ussion attitude scores. Contrary to ex- 
pectation, women did not show a relatively 
greater change in attitude scores than did men, 
nor did the scores improve significantly for 
either sex following the discussion. Male and 
female Ss in the control groups also showed no 
significant improvement in attitude scores and 
a comparison of the change for experimental 
control Ss revealed no 


versus significant 


differences. 
DISCUSSION 


[he answers to the five questions support 


the hypothesis that previously reported sex 


differences in problem-solving preformance are 
attributable in part to sex differences in atti- 
tude toward problem solving. An explanation 
of this difference in attitude probably lies in 
our culture, which still adheres to some extent 
to the view that “Woman is the unreasoning 
animal and pokes the fire from on top.’ It is 
hypothesized that men tend to be more favor- 
ably disposed toward problem solving because 
they have been encouraged in this attitude and 
because acknowledgment of an unfavorable 
attitude is a denial of one aspect of the mas- 
culine role. Some support for this view is shown 
in the relationships found between attitude 
toward problem solving and mathematical and 
verbal The correlations for 
women’s aptitude and attitude scores were 
highly significant for both aptitude measures, 


aptitude (2). 


5 Quoted by William James, in a discussion of why 
wemen are relatively poor at reasoning (4, p. 368) 
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but there was no relationship between men’s 
aptitude and attitude scores—a finding which 
indicates that men tend not to admit to a dis- 
like for problem solving even when they are 
lacking in intellectual aptitude. 

Although a sex difference in improvement of 
problem-solving performance was the major 
prediction, it was also predicted that there 
would be a sex difference in improvement of 
attitude scores. The lack of any improvement 
in attitude scores warrants further consider- 
ation. 

One explanation for this finding is that the 
scale was not sensitive enough to reflect the 
changes in attitude that occurred, since a 
number of the items referred to interests and 
activities with respect to which Ss would have 
had no opportunity to alter their behavior be- 
fore the second testing session (e.g., “I enjoy 
being the score keeper when playing bridge or 
canasta’’). Another explanation for the lack of 
improvement in attitude scores is based on the 
occurrence of catharsis during the discussion. 
In this permissive situation Ss were encouraged 
to explore their attitudes toward problem 
solving—whether positive or negative—with 
the result that they felt quite free to express 
negative attitudes. The cathartic effect of 
discussion may have been such as to result in a 
freer, more realistic expression of attitude. 

A final possibility, of course, is that the discus- 
sion did not modify attitude and that other 
factors accounted for the improvement in 
women’s performance. Practice in solving the 
first set of problems might result in such an 
improvement if women were more affected by 
the practice than were men, but this factor was 
controlled by the use of female Ss who had a 
similar exposure to the problems without the 
intervening discussion. The fact that the con- 
trol Ss did not improve in performance suggests 
that the effect of solving the first set of prob- 
lems was to counterbalance the positive effects 
of practice by reinforcement of the existing 
negative attitudes. An examination of the dis- 
cussion procedure indicates that motivations 
such as sex rivalry, increased level of aspira- 
tion, or a desire to please the discussion leader 
do not appear to account for the improvement 
in women’s performance. At no time did Ss 
in the discussion groups comment on sex differ- 
ences in problem-solving performance, and the 
Es were under instructions not to refer to such 
differences or to the real purpose of the discus- 
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sion. In fact, Ss did not know that they would 
be given more problems to solve after the discus- 
sion session, although the remark was occasion- 
ally made at the end of the discussion that if 
Ss had a chance to work on more problems, 
they thought they would do a better job of 
solving them. No scores or answers were given 
out during the experiment so that change in 
level of aspiration is also not a very likely pos- 
sibility. By the use of both male and female 
Es, interaction between sex of E and sex of S 
could be determined, but the results indicated 
that such interaction did not occur. Since Es 
made no attempt to exhort Ss to improve their 
attitudes or performance, it is unlikely that 
female Ss improved in performance because 
they were motivated to please Z, although this 
possibility cannot be ruled out entirely. 

Consideration of possible alternative ex- 
planations for the significantly greater im- 
provement in women’s performance leaves 
change in attitude as a tenable one even though 
the attitude scale itself was probably not 
sensitive enough to reflect a change. 

A final comment may be made on the fact 
that there were marked differences among Es 
in the relative improvement of female Ss. The 
E difference was not due to an interaction be- 
tween sex of £ and sex of S, and the discussion 
leaders were of approximately the same age 
and level of academic attainment. Since the 
discussion procedure was also fairly well 
standardized from one group leader to another, 
it would appear that the differences were re- 
lated to personality characteristics of the Es. 


SUMMARY 


The study was designed to test the hypothe- 
sis that sex differences in problem-solving per- 
formance which are not the result of differences 
in general intelligence, special aptitudes, or in- 
formation, are attributable to differences in 
First, a 

toward 


attitude toward problem solving. 

scale designed to measure attitude 
problem solving was constructed. Then the 
attitude scale and a series of ten problems were 
administered to experimental groups consisting 
of three men and three women each. The 
group members then participated in a discus- 
sion, with the disguised intent of promoting a 
more favorable attitude toward problem solv- 
ing. In a postdiscussion session, Ss were ad- 
ministered an alternate form of the attitude 
scale and a second set of problems, comparable 
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to those used in the prediscussion session. A 
control group that did not participate in the 
discussion was also used. 

The data were analyzed according to their 
bearing on five questions related to the major 
hypothesis. 1. Can a scale be constructed that 
measures attitude toward problem solving? 2. 
Is there a sex difference on such a scale? 3. Is 
problem-solving attitude related to problem- 
solving performance? 4. Is an attempt to 
change problem-solving attitude experimen- 
tally followed by a change in performance? 
5. Do women respond more favorably than 
men to an attempt to improve their attitudes? 

The attitude scale gave some internal evi- 
dence of validity, and men were found to have 
significantly higher scores on the scale than 
women. Also, attitude scores were found to 
have some positive relationship to performance 
scores. After the group discussion, designed to 
induce a more favorable attitude toward prob- 
lem solving, women showed a significant im- 
provement in performance, while men showed 
no change. An examination of some possible 
ways of accounting for the relatively greater 
improvement of women’s postdiscussion per- 
formance indicated that improvement of 
attitude is at least a tenable one. In general, 
the findings support the hypothesis of sex 
differences in problem solving as a function of 
sex differences in attitude. An explanation of 
such attitudinal differences in terms of cultural 
differences for men and women is ofiered. 
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PERSONALITY AND IQ CHANGE! 
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ESEARCH on mental development during 
the last twenty years has indicated 
that a child’s IQ score does not neces- 

sarily remain constant with age (2, 3, 4, 10). 
Several reports (9, 10, 12) suggest that changes 
in environmental conditions can depress or 
raise IQ level and it is sometimes implied that 
these changes may be explained by recourse to 
personality variables. The purpose of this 
paper is to demonstrate that changes in IQ 
during childhood are correlated with certain 
personality predispositions as inferred from 
projective test data. The personality variables 
under study include (a) need for achievement, 
(6) competitive strivings, (c) curiosity about 
nature, and (d) passivity. 

Performance on an IQ test is assumed to be 
a function of at least two major variables; the 
variety of skills and abilities the person brings 
to the test situation and his motivation to per- 
form well on the test (2, 6). Since the IQ scores 
of some children change markedly during the 
school years, it seems plausible to assume that 
those children who show marked increases in 
IQ have a very strong motivation to acquire or 
develop the various intellectual skills tapped 
vy an IQ test and to perform well in a testing 
situation. It is suggested that need for achieve- 
ment, competitive strivings, and curiosity 
about nature motivate the acquisition and im- 
provement of cognitive abilities and by so doing 
facilitate increases in tested IQ. 

The social environment often awards praise 
and recognition for intellectual accomplish- 
ment, and school age children with a high need 
for achievement might seek to gratify this need 
through intellectual activity. Thus it was pre- 
dicted that children showing marked increases 
in IQ would produce more achievement 
imagery on the TAT than those with minimal 
gains in IQ. 

Secondly, the school environment empha- 

‘This investigation was supported in part by a 
research grant (PHS M 1260) from *he National In- 
stitute of Mental Health of the National Institutes of 
Health, United States Public Health Service. The 
writers wish to thank Dr. Seymour B. Sarason for his 
critical reading of the manuscript. 


sizes competitive intellectual activity, and 
children with strong competitive needs would 
be highly motivated to acquire the intellectual 
skills which result in successful competition 
with one’s classmates. Thus it was predicted 
that children showing IQ gains would show 
more competitive strivings than children dis- 
playing minimal gains in IQ. In choosing an 
index of cornpetitive strivings, besides the re- 
lated measure of TAT achievement fantasy, it 
was decided to use aggressive content on the 
Rorschach. The bases for this choice rested on 
the assumptions that (a) incidence of aggres- 
sive imagery reflected degree of aggressive 
motivation and (+) competition was a socially 
accepted form of aggressive behavior. For in 
competition, as in aggression, the child desires 
to defeat another individual and asse:t his 
superiority over him. The population of chil- 
dren in this study is predominantly middle 
class and apt to place strong inhibitions on 
direct, overt expression of aggression. There- 
fore, there would be a tendency for the indi- 
vidual with high aggressive motivation to seek 
socially accepted channels for aggressive ex- 
pression such as competitive activity with 
peers. Thus it was predicted that children 
showing IQ gain would report more Rorschach 
aggressive content than those with minimal 
gain because of their greater competitive pre- 
disposition. 

A third motive that might facilitate a child’s 
acquisition of knowledge and skills in dealing 
with the environment could be curiosity about 
nature. Interest in birth, death, sexual anat- 
omy, and other processes of nature is a fre- 
quent phenomenon in young children. It is sug- 
gested that the more intense this curiosity the 
greater the motivation to acquire the habits 
which would gratify this motive. Since reading, 
questioning, and manipulating the environ- 
ment are effective behavioral methods of 
gratifying one’s curiosity, it might be expected 
that the highly curious child would be more 
likely to develop these skills and therefore apt 
to gain in IQ score. The TAT measure used to 
evaluate curiosity was presence of themes of 
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interest in nature and its phenomena. For the 
Rorschach, it was hypothesized that concern 
with the body might reflect, in part, heightened 
interest in natural processes, and it was sug- 
gested that anatomy content might be more 
frequent for children who showed marked IQ 
gains than for those with minimal increases in 
IQ. It is recognized that many clinical psy- 
chologists regard anatomy content in adults as 
indicative of psychopathology. This study is 
concerned with the correlates of IQ gain rather 
than psychopathology, and it is not implied 
that children who show increases in IQ are 
completely free of conflict. Secondly, it was felt 
that the determinants of anatomy content for 
children might be different from those which 
produce this content in adults. 

A final prediction dealt with the predisposi- 
tion to behavioral passivity. The children who 
show IQ gains have heen characterized as 
having high need achievement, competitive 
strivings, and curiosity about the environment. 
This constellation of motives implies that when 
these children are confronted with a problem, 
they would have a tendency to attack and 
attempt to solve the problem rather than with- 
draw from the situation or seek help. On this 
basis, it was predicted that children who 
showed IQ gains would be less likely than those 
with minimal IQ increases to characterize their 
TAT heroes as passive in attitude or behavior. 

The Fels Research Institute is uniquely 
equipped to test these ideas about IO change 
since it has continuous longitudinal information 
on the development of a sample of normal 
children. These data include intelligence and 
projective tests, observations of the children, 
and reports on the parent-child interaction. In 
a recent study, Sontag, Baker, and Nelson (11) 
related personality information on a sample of 
children with changes in IQ and found that 
those children who showed marked increases in 
IQ were rated as more competitive, more likely 
to display self-initiated behavior and less pas- 
sive than those who showed decreases in IQ. 
The TAT and Rorschach protocols were not 
utilized in making these personality ratings, and 
the results from this study served as a major 
stimulus for the present investigation. 


METHOD 


\ sample of 140 Fels subjects (Ss), 70 of each sex, 
were chosen for study because a fairly complete record 


of test information was available on them. From ages 
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2% to 6, the Stanford-Binet intelligence test (1916 or 
1937 revision) was administered to most Ss _ twice 
yearly, on their birthdays and six months after their 
birthdays. From ages 6 to 12, most Ss received alter 
nately Form L or Form M of the 1937 revision annually 
on or near each S’s birthday. All of the tests were ad 
ministered by one of the authors (VLN). The mean IQ 
of the Fels population is near 120, with standard devia 
tion varying from 14 to 20 IQ points. 

In order to obtain groups of Ss who showed the most 
change in IQ score 10, a 
longitudinal plot of each S’s IQ was prepared by averag 
around 


from ages 6 to smoothed 


ing the mean of three consecutive test sceres 
each age. This procedure is explained in detail in other 
reports (1, 10, 11). This technique tends to eliminate 
erratic variations in IQ and hopefully furnishes a more 
valid measure of IQ changes. Then each S’s smoothed 
IQ at age 6 was subtracted from his smoothed IQ at age 
10, and this distribution of differences, positive if S 
gained in IQ and negative if S lost in IQ, was divided 
into quartiles. This report deals with the projective 
test information on those Ss in the two extreme groups; 
those who increased and those who decreased the most 
in IQ score. These will be called Group A, the IQ as 
cenders, and Group B, the IQ descenders, respectively 
There was no significant difference between the mean 
IQ of the two extreme quartiles at age six, the means 
being 119 and 116 for Groups A and D respectively 
The average amount of increase in IQ for Group A 
was larger (plus 17 points) than the corresponding de 
crease for the members of Group D (minus 5 points) 
and while 46 per cent of Group D lost five or more 
points, every child in Group A gained 10 or more points 
during the years 6 through 10. The mean IQ of the 
entire sample of 140 tends to increase slightly from ages 
6 to 10, probably as a result of practice effects with the 
same test. Since every S in Group D showed a decrease 
in IQ, it might be inferred that the members of Group 
D did not benefit from practice and familiarity with the 
test, and it is probably more accurate to view Group 
D Ss in this light rather than as Ss who showed marked 
decreases in IQ score. 

The projective tests used in the 
Rorschach and TAT pictures 
governed the choice of the TAT 
analyzed. Because the protocols were gathered over a 
period of years, there was not complete comparability 
for all Ss for the number of cards administered. Sec 
ondly, the specific hypotheses of the study dictated the 


analysis were the 
Two 


whic h 


selected factors 


cards were 


cards chosen for analysis and Cards 1, 3 BM, 3 GF, 5, 
6 BM, 12 F, 14, and 17 BM were selected for analysis. 
The age at which the TAT protocols were administered 
ranged from 8-9 to 14-6 with median at 11-6 and 80 
per cent of the protocols obtained between the ages of 
11 and 12. The age at which the Rorschachs were ad 

ministered ranged from 6-5 to 13-6 with median at 
10-5 and 63 per cent of the sample having had the test 
between ages 10 and 11. Since the Rorschach and TAT 
were administered by different examiners there was no 
comparability with respect to inquiry or probing. Thus, 
the analysis of both the Rorschach and TAT was re 

stricted to the S’s spontaneous verbalization to the 
stimulus before any questions or inquiry were conducted 
by the examiner. The protocols were scored for the 


following fantasy categories 
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1. Need achievement on the TAT. Achievement 
magery on the TAT was scored according to the 
definition of McClelland et al. (8); and themes involv 
with a 

were scored achievement imagery 
Rorschach aggre 
ontent on g 


the Re 
ures engaged in physical or verbal aggression, 


reference to competition standard of 


ion. The definition of aggressive 
wrschach included (a) people, animals, 
explosive objects or ex- 


e.g., fighting or quarreling 


plosions, e.g volcanoes, 


g., bombs exploding, fireworks, 
and (c) objects or animal parts normally regarded as 
struments of rifles, clubs, 
, knives, 
3. Intel 


osity was defined 


aggression, €.Z., 


horns, and claws 


spears, 


For the TAT, 


in terms of themes in which some- 


clual curiosity about nature 
is interested in the processes or phenomena of 
‘uriosity on the Rorschach was restricted to 
or X-ray 
boney parts, e.g., stomach, backbone, ribs. 

4. Passivity. Because of the limited 

smatic material in the spontaneous performance, 
themes of passivity were limited to stories in which the 
central figure was described as sleepy, tired, or resting 


nature. ( 
internal 


anatomy responses of organs or 


amount of 


The fantasy categories were independently scored by 
the senior author and an assistant without knowledge 
of the S’s IQ scores.? Reliability was very high because 
i the limited amount of content scored for each re 
sponse and the objectivity of the definitions. Percentage 
yf agreement for the three TAT categories was 95 per 
cent and for the two Rorschach categories 99 per cent 

RESULTS 

Although there was a total of 70 Ss in the 
two extreme quartiles, not all of the Ss had 
Rorschach or TAT data for the age range under 
study. Table 1 shows the distribution of Ss, by 
sex and direction of IQ change, for the TAT 
Because there are 
approximately twice as many boys as there are 
girls in Group A, all comparisons were first 
made separately by sex and results were only 
combined if the direction of the result for both 
boys and girls in the same IQ group was in the 
predicted direction. 

1. Need achievement. All 
themes, save one, occurred to Cards 1 and 17 
BM. The typical achievement story to Card 1 
concerned a boy who wanted to master the 
violin and/or become a famous violinist, while 
the typical achievement theme to 17 BM in- 
volved competitive activity with regard to 
rope climbing. Table 2 shows the percentage of 
Ss in each group reporting achievement im- 
agery plots to Cards 1, 17 BM, and to both 


and Rorschach analyses 


achievement 


pictures. 
For both Cards 1 and 17 BM, more male and 
female Ss in Group A report achievement 
* The writers wish to thank Mary Schnurer for her 
assistance in assessing the reliability of the scoring. 
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TABLE 1 
DISTRIBUTION OF Ss BY SEX AND DrrecTION oF IQ 
CHANGE USED IN THE ANALYSIS OF THE 
TAT anp RorscHACH 


Rorschach 


Gr up — a or 
Boys Girls 


Group A 22 22 10 


Group D y 9 18 
Both groups y 3 31 28 


TABLE 2 
PERCENTAGE OF Ss REPORTING ACHIEVEMENT 
IMAGERY TO CarRDs 1 AND 17 BM 


Group A Group D 


Boys 
Boys Girls| and 
Girls 


Boys 
Boys Girls and 
Girls 
40.6 |27.3)15.0) 19.4 
34.4 | 0.0/15.0) 9.7 
18.8 | 0.0 0.0) 0.0 


Card 1 36.4 50.0 

Card 17 BM 36.4 30.0 

Cards 1 and 17 22.7:10.0 
BM 


imagery than the boys or girls of Group D. For 
Card 1, the difference between Group A and 
Group D girls is reliable at the .03 level; the 
difference for boys is in the predicted direction 
but not significant. For Card 17 BM, the differ- 
ence between Group A and Group D boys is 
significant (P = .03) and in the predicted di- 
rection for girls. All P values are for one tail 
and were evaluated using the exact method 
suggested by Fisher (5). When the sexes were 
pooled, comparisons between Groups A and D 
were significant not only for Cards 1 and 17 BM 
separately but also for the number of Ss telling 
achievement imagery to both Cards 1 and 17 
BM (P < .10, .03, and .01 respectively). Thus, 
the Ss who showed increases in IQ were more 
prone to structure Cards 1 and 17 BM in 
terms of achievement oriented behavior than 
the Ss in Group D. 

2. Aggressive content on Rorschach. There 
was no significant difference between Groups 
A and D or between boys and girls with respect 
to the mean number of responses per protocol, 
and the mean for the entire sample was 27 
responses. There was no difference between 
Group A and Group D girls with respect to 
percentage of each group reporting one or more 
aggressive responses per protocol (30.0 per cent 
for Group A versus 33.0 per cent for Group D). 
However, the difference between Group A and 
D boys approached significance with 59.1 per 
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cent of the former and 22.2 percent of the latter 
reporting one or more aggressive images 
(P = .07). Thus, the prediction of a correla- 
tion between IQ increase and aggressive 
imagery held only for the boys. Because of the 
tentativeness of this result and the more 
speculative nature of the hypothesis relating 
competitive striving and aggressive content, an 
attempt was made to validate this finding by 
analyzing a later Rorschach protocol for the 
boys in Groups A and D. Not all of the boys 
had Rorschachs administered to them at a 
later age, and only 15 Ss in Group A and five 
in Group D were available for analysis. The 
median ages at the time of administration 
were 13-8 and 15-0 for Groups A and D re- 
spectively, and there was no significant differ- 
ence in the lengths of the protocols of the two 
groups. The results were in the same direction 
for 86.7 per cent of Group A, and 20.0 per cent 
of Group D reported one or more aggressive 
images, and this difference is highly significant 
(P = .01). 

3. Intellectual curiosity. The only TAT card 
eliciting curiosity plots was Card 14, and the 
typical theme described a person gazing at or 
interested in the stars or the heavens. Table 3 
shows the percentage of each group telling such 
themes to Card 14. 

Both the boys and girls in Group A told 
more themes of interest in the stars or heavens 
than the males and females in Group D 
(P = .14, P = .10, respectively) and combining 
of the sexes yielded a highly significant differ- 
ence between Groups A and D (P < .01). 

4. Anatomy and X-ray responses on the 


TABLE 3 
PERCENTAGE OF Ss REPORTING THEMES OF CURIOSITY 
To Carp 14 
Sex Group A Group D 
Boys 40.9 
Girls 30.0 
Boys and girls 37.5 


TABLE 4 
PERCENTAGE OF SS REPORTING THEMES OF PAssIVvitTy 
To Carp 3 BM 
Sex Group A Group D 
27.3 
10.0 45.0 
9.4 38.7 


Boys 9.1 


Girls 
Boys and girls 
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Rorschach. There was no difference between 
Group A and Group D girls reporting one or 
more anatomy responses (30.0 per cent versus 
38.9 per cent for Groups A and D respectively). 
For the boys, 31.8 per cent of Group A and 0.0 
per cent of Group D reported anatomy or 
X-ray imagery, a difference that approached 
significance (P = .06). This finding was also 
validated on the same sample of 20 boys that 
was used to check the differences in aggressive 
content. The results were in the same direction 
with 60.0 per cent of Group A and 20.0 per cent 
of Group D reporting anatomy content 
(P = .15). 

5. Passivily. Card 3 BM accounted for most 
of the passivity themes and the groups were 
compared with respect to the incidence of 
stories to Card 3 BM in which the central 
figure was sleepy, tired, or resting. Table 4 
shows the percentage of each group telling such 
themes. Both the boys and girls in Group D 
showed more passivity themes than the boys 
and girls in Group A. Although only the differ- 
ence for the girls was significant (P = .06), 
when the sexes were pooled the difference was 
highly reliable (P < .03). 

Cards 3 GF, 5,6 BM, and 12 F did not furnish 
data relevant to the hypotheses under test 
and these results are not summarized. 

DIscUSSION 

In the main, the hypotheses about the differ- 
ences between Groups A and D have been 
verified. Boy and girl ascenders produced more 
TAT achievement imagery and curiosity about 
nature than Group D children and male 
ascenders displayed more aggressive content on 
the Rorschach than the boys in Group D. The 
higher incidence of aggressive imagery for the 
boys who gained in IQ was interpreted as re- 
flecting stronger competitive motivation. 
Finally, the Ss in Group D were presumed to 
have a more passive orientation since they 
were more likely to perceive the ambiguous 
figure on Card 3 BM as sleeping or tired. The 
relation between Rorschach anatomy content 
and IQ gain was the most tentative finding. 

The results are interpreted as indicating 
that high motivation to achieve, competitive 
strivings, and curiosity about nature may moti- 
vate the acquisition of intellectual skills and 
knowledge which, in turn, facilitates increases 
in tested IQ. If one accepts the generally 
assumed notion that boys are more competitive 
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and achievement oriented than girls, the fact 
that there were twice as many boys in Group A 
as there were girls supports the present inter- 
pretation. A recent study using the Edwards 
Personal Preference Schedule found that high 
school boys obtained higher need achievement 
scores than high school girls (7). 

These results are not interpreted as indi- 
cating that strong achievement, competitive, 


and curiosity motives are the only variables 


involved in producing gains in IQ. The Ss in 
this study are all average or above in IQ and 
there is not adequate sampling of children 
with lower IQ levels. One would not expect Ss 
with low IQs or language handicaps to sud- 
denly show an interest in reading despite 
achievement needs or intellectual curiosity. 
The child who spends increased time reading 
because of a heightened interest in natural 
processes must have already learned the basic 
reading skills so that this behavior is not a 
difficult or unlikely choice for him. 

Similarly, needs for achievement and success- 
ful competition should only motivate attempts 
at improvement of intellectual abilities in a 
social milieu where praise, recognition, and 
superior status are awarded for such accom- 
plishment. That is, achievement-oriented 
children from homes in which intellectual ac- 
tivity was praised would probably be more 
likely to master intellectual skills than achieve- 
ment-oriented children from homes in which 
such accomplishment was not rewarded. In a 
cultural environment where athletic ability, 
fighting prowess, or success with the opposite 
sex was highly valued, one might expect the 
child to choose these behavioral channels to 
gratify his achievement and competitive needs. 
The parents in the Fels population are pre- 
dominantly middle class and tend to place im- 
portance on intellectual accomplishment. A 
large majority of the parents have attended 
college, and since enrollment in the Fels pro- 
gram is voluntary it might be inferred that 
only parents valued knowledge and 
scientific pursuits would be predisposed to be- 
come part of the research population. Thus, the 
children under study tend to come from homes 
which value intellectual ability. 

Study of the educational attainment of the 
parents of the Ss in Groups A and D revealed 
no significant difference between the groups 
with respect to the percentage of families in 
which both parents attended college (57.1 per 


who 
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cent for Group A versus 42.9 per cent for 
Group D; P > .30). Although there is a slight 
difference favoring the educational levei of 
Group A families, the difference was not 
dramatic. There may be important differences 
between Groups A and D with respect to the 
differential encouragement of intellectual 
achievement, but measurement of these differ- 
snces would probably require variables more 
refined than educational level of the parents. 
However, even though parental emphasis on 
intellectual activity may increase the child’s 
desire to improve his cognitive skills, the 
child’s predisposition to adopt or rebei against 
parental values should selectively influence his 
motivation to strive for intellectual accom- 
plishment. Thus, the type of relation between 
parent and child may be an important factor in 
this process. 

Finally, there is the possibility that genetic 
and/or constitutional variables may play a 
role in facilitating marked IQ changes. There 
is considerable data indicating that genetic 
factors influence general IQ level but less evi- 
dence relevant to the role of these variables in 
producing childhood increases in IQ score. For 
most of the children in our population, IQs 
tend to level off during the ages 6-10 and most 
of the marked changes in level occur during the 
preschool years. However, the exact relation- 
ship between genetic variables and IQ change 
has yet to be determined. The phenomenon of 
IQ increase during the schoo! years is ad- 
mittedly complex and it is not implied that the 
child’s motives are the major factor. However, 
it is suggested that personality needs may in- 
fluence this process. Perhaps the most accurate 
generalization is that for middle-class children 
with average or above IQ levels, strong achieve- 
ment, competitive, and curiosity needs may 
facilitate IQ gains by motivating the child to 
master intellectual skills. 

A final implication of these findings is that 
they add indirect evidence for the usefulness 
of the Rorschach and TAT as research instru- 
ments. Validation of a predicted relationship 
between TAT achievement imagery and IQ 
gain increases one’s confidence in the hypothe- 
sis that TAT plots can serve as an index of 
achievement-oriented tendencies. The results 
of the Rorschach analysis suggest that aggres- 
sive content may be an index of an individual!’s 
aggressive predispositions but not necessarily a 
measure of his tendency to express direct, 
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physical aggression. Although Sontag, Baker, 
and Nelson (11), using behavioral observa- 
tions, rated the boys in Group A as more 
competitive than those in Group D, there was 
no difference between these groups with re- 
spect to intensity or incidence of direct verbal 
or physical aggression or destruction of 
property. We have assumed that competition 
is a socially approved form of aggressive be- 
havior and the higher incidence of aggressive 
content for Group A boys was presumed to be a 
result of their more intense competitive striv- 
ings. Some clinicians who use projective tests 
are too prone to focus on predictive statements 
about direct, physical aggression when con- 
fronted with a protocol containing aggressive 
content. One is apt to overlook the fact that 
the individual may have alternative behavioral 
channels for expression of aggressive motives. 


SUMMARY 


For a group of 140 boys and girls in the Fels 
Research population on whom continuous 
Binet IQ data were available, a distribution of 
IQ change was obtained by subtracting each 
S’s smoothed IQ at age 6 from his smoothed IQ 
at age 10. This distribution of differences was 
divided into quartiles, and the Rorschach and 
TAT protocols of the upper, (maximum in- 
crease in IQ) and lower (maximum decrease in 
IQ) quartiles were analyzed and compared. The 
results showed that in comparing the Ss who 
showed iQ increases with those showing IQ 
decreases, the former had, on the TAT, signifi- 
cantly more (a) achievement imagery on Cards 
1 and 17 BM and (6) themes of curiosity about 
nature on Card 14, and significantly fewer 
themes of passivity on Card 3 BM. For the 
boys only, more of the Ss who increased in IQ 
had anatomy responses and aggressive im- 
agery the Rorschach. The results were 
interpreted as that need 


on 


indicating high 





T. BAKER, AND V. L. NELSON 





achievement, competitive striving, and curi- 
osity about nature are correlated with gains in 
IQ score because they may facilitate the ac- 
quisition of skills that are measured by the 
intelligence test. 
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HO smokes? Who doesn’t smoke? 

Who smokes too much? Who can 

stop smoking? Reviewing the sparse 
literature on smoking and personality, we 
could find neither descriptive data answering 
these questions nor good theory from which the 
answers might be deduced. Therefore, we have 
made a first attempt at delineating a psychol- 
ogy of smoking. 

Our basic data come from The Study of 
Adult Development, once known as the Grant 
Study. That study has been described else- 
where (6). Our subjects were a panel of 252 
Harvard alumni who were selected during 
their sophomore years for lack of visible 
abnormality. During their sophomore years, 
which fell between 1938 and 1942, they were 
ange of medical, physio- 
anthropological, and 


studied by a great 
logical, psychological, 
sociological techniques. We have been follow- 
ing these men since by annual questionnaire, 
retesting, and visits. Questions about smoking 
habits were routinely.asked in the question- 
naires during the fifteen or so years the men 
have been followed. This very full material 
permitted the study staff to make comparisons 
between smoking habits and hundreds of 
variables. Only the findings relevant to psycho- 


SOc! 


al theory are reported here.” 


WHO SMOKE? 

A striking number of our subjects did not 
smoke. When the men were first studied, as 
sophomores, 108 of them (some 45 per cent) 
had not begun to smoke at all. Sixty-one never 
have. That is about 24 per cent of the group. 


DOESN’I 
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2Some of the other findings of the study staff 
especially those in relation to medical and physical 
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In any one year, more than 40 per cent of the 
men will be “off” smoking, though for some 
this is very temporary. 

Our theory begins with Bales’ very general 
propositions about the origins of any com- 
pulsive habit. Bales (2) began by studying 
compuisive drinkers. He found that “analysis 
shows the craving to be a result of at least two 
types of underlying elements: (1) some need 
or complex of needs. . . and (2) an orientating 
structure of habitual thought patterns and 
associated emotional justifications.” Orienta- 
tion has to come first; if the experience is 
found gratifying, it may be put into the serv- 
ice of needs. As Bales (3) says, “it is during 
the formation of the compulsive habit that the 
influence of the society and its culture come to 
a critical focus in the individual person and 
work their effect.” In particular, it is society 
that determines whether the new habit seems 
to be “‘a means of relieving. . . inner tensions, 
or whether such a thought arouses a strong 
counteranxiety.” 

Convergent theories anthropoiogists, 
sociologists, and psychologists, reviewed and 
empirically confirmed elsewhere (10, 13, 14,15), 
all suggest that the use of tobacco to resolve 
major personal needs might “arouse a strong 
counteranxiety” in upwardly mobile members 
of America’s lower middle class. It seems likely 
that this group differs in their attitude toward 
smoking from people who stand both above 
and below them in the’ social hierarchy. Their 
attitude to smoking would, presumably, be 
corollary of a basic value system that these 
people hold in contrast to most of contem- 


oi 


porary society. 

The source of this value system is described 
by Weber (23) as the Protestant Ethic. It is a 
work morality, emphasizing the Devil’s stake 
in idleness and self-indulgence. It is an ethic 
that abhors as Sin the wasting of one’s sub- 
stance in unprofitable trivia, since such waste, 
in the original protestant dogmas, was evi- 
dence that a man had not been vouchsafed 
Grace. This ethic was, as Mills (18) remarks, a 
morality of producers, who abhorred consum- 
ers. It is typical that, after one especially long 
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Calvinistic debate, tobacco was settled upon as 
a good thing because the growing of it produced 
profit. 

This theology of “provident purposes” was 
soon secularized in the form of “worldly 
asceticism,” as Weber illustrates from the 
diaries of Benjamin Franklin. Indeed, the 
process of lay morality descending from reli- 
gious dogma is still going on, even among the 
Harvard group we are studying, as Allport (1) 
has shown in his study of religion in the post- 
war college student. It is thus that the tradi- 
tional middle-class values were generated. 

These values are now being replaced in our 
society, according to sociologists. They have 
become the property, though perhaps not 
exclusively, of the “Old Middle Class.” They 
belong to Riesman’s (20) ‘“Inner-Directed” 
group, so soon to be replaced by the Outer- 
Directed; to Mills’ (18) “cheerful robots,” 
so soon to become equally cheerful mass con- 
sumers. Yet it is these values that have 
generated what Mead (16) calls “the American 
core culture’ and what Kluckhohn (10) 
describes as the dominant value profile in 
American culture and expects to find focussed 
in the middle class, and which McArthur 
(13, 14) has empirically shown to be focussed 
in the lower middle. It still gives rise in the 
American middle class to a new generation 
instilled with the “achievement mores’”’ whose 
operation in a contemporary urban setting is 
described by McClelland (15). Children reared 
in this tradition are mobile, following what 
Miller and Form (17) describe as Ambitious 
Careers. As one study (14) shows, “The 
Future, Doing-oriented family must produce 
sons reared in the ‘achievement mores,’ 
taught to look forward to by-passing or sur- 
passing their fathers’ occupational roles.... 
It is the hopes of the mother that these sons 
must realize in order to feel successful. They. . . 
will have introjected her precepts. It is these 
boys who will, after college, be expected to 
leave the family and ‘make their own way.’ ” 
Children reared in this tradition are still (15) 
weaned and trained early, encouraged to be 
ambitious, hard-working, and clean-minded. 
“T know that Andy has a clean mind,” wrote 
one father of a nonsmoker, “‘he does not smoke, 
he does not loiter on the street with gangs, he 
has been brought up to look for the fine things 
in life and only by hard work and perseverance 
can they be achieved and enjoyed.” 


CHARLES McArtuur, ELLEN WALDRON, AND JOHN DICKINSON 


In the study, we have found (13) that a 
simple empirical tool for bringing out middle 
vs. upper class contrasts is to compare Harvard 
men who prepared for college in public high 
schools with Harvard men who prepared for 
college in private preparatory schools. The 
high-school graduate who goes to Harvard 
has usually received a heavy dose of “work 
morality” during his rearing. As an earlier 
study (14) suggested, “His role in the family 
often is that of standard-bearer, the son who 
is to win status or ethnic mobility.” The 
prep-school boy who goes to Harvard is more 
likely to have been bred on a code of acceptable 
behavior. Tradition sets bounds on him in 
terms of “what is done,” not in terms of 
“worldly asceticism” as a means of “getting 
ahead.” If a concentration of nonsmokers 
exists in our data, then we may well predict 
that it will focus among our public-school 
graduates. 

This is the case. In a steady progression 
with status, 20 per cent of the graduates of 
very exclusive private schools do not smoke, 
30 per cent of the graduates of less exclusive 
private schools, and 40 per cent of the gradu- 
ates from public schools. This pattern is 
significant at the .01 level. Part of the explana- 
tion of this fact may be that the prep-school 
boys learned to smoke in their dormitories 
(rules notwithstanding) while the high-school 
graduate did not experience dormitory life 
with its peer-orientation until coming to 
college. Some facts fit this notion. In retrospect, 
about three-quarters of the private-school 
graduates who smoke say that they learned to 
smoke in prep school or at the time of prep 
school; about three-quarters of the public- 
school graduate smokers say they learned to 
smoke in college. As one progresses from 
private boarding to private day to public day 
schools, there is a significant (.01) decline in 
the proportion of boys who already smoked 
at the time they were taken into our study. 
However, there is a status difference operating 
together with this dormitory factor. If we hold 
boarding constant, there remains a significant 
(.02) difference between the more exclusive 
and the less exclusive schools. Such a pattern 
may well have been generated in the families 
who elect to and are able to send their sons 
to these schools, rather than in the schools 
themselves. Besides, the tacit other half of 
the dormitory argument is that high schools 
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fo not orient their pupils toward smoking. 
This cannot be said. Hollingshead (8) tells 
us that in “Elmtown” high school, “practically 
all boys (91 per cent) smoke, irrespective of 
age or (social) class.” Our public-school 
nonsmokers are therefore somehow special. 
It may be supposed that their rearing for 
mobility, in terms of the old ethic described 
earlier, may have created a_ considerable 
“strong counteranxiety” to undermine the 
orientation from their peer culture toward 
learning to smoke. 


In Elmtown, 

Law and the mores deny high-school students the 
right to enjoy the pleasures derived from tobacco, 
gambling, and alcohol. However, the mystery with 
which adults surround these areas of behavior lends 
them a special value which seems to act as a stimulus 
to many young people who desire to experience the 
supposed thril] of pleasures their elders deny them. The 
conspiracy of silence which is an essential part of the 
landestine violation of the mores has already taught 
them how easy it is to avoid restrictions imposed by 
law and taboo if they are die-reet about how, where, 
and under what circumstances it is done. Acquisition 
xf knowledge of the means of transgressing against 
alcohol, tobacco, and gambling taboos without being 
caught and the thrill of violating these taboos take 
place for the most part in the clique 


Presumably any peer-oriented youngster is 
likely to smoke, whatever his social status, as 
part of a kind of defiant claim on adult status. 
It may be supposed that the more earnest 
students in high school take adult prohibitions 
more seriously and, perhaps, do not always fit 
well with “the clique.” Theirs may be a more 
compliant claim on adulthood. Perhaps among 
the earnest nonsmoking nine per cent of 
Elmtown seniors, there was one who applied 
to Harvard. More went to other colleges and 
were mobile into white collar technical and 
professional occupations. It is not to be 
supposed that attendance at any one university 


is a potent cause. Census data (5) show a 


nationwide increase of nonsmokers when 
professional and technical people are com- 
pared with other occupational groups. Non- 
smoking in America is a social class phenom- 
enon. 

It is in England, too. A Hulton survey (7) 
suggested that nonsmoking was slightly com- 
moner among men of the English middle class 
while heavy smoking was commoner among 
men. An _ excellent 


English working-class 
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market survey*® in England not only confirms 
this difference in rate of smoking but also 
documents in many ways the fact that non- 
smoking is widely perceived as a mark of 
“middle-class respectability.” 

Our data certainly suggest that smoking is 
popularly defined as faintly disreputable or, 
at least, one of the small vices. Nonsmokers 
are significantly often (.01) nondrinkers; 
indeed, it seems that, as Straus and Bacon (22) 
report from Yale, drinking precedes smoking 
as the student takes up adult foibles. Our 
nonsmokers are also significantly often (.01) 
people who do not drink coffee. 

We may expect some orientation toward 
nonsmoking to have been taught these men 
by their religions. The kind of piety that Weber 
describes should spill over into self-denial, 
even in the absence of a specific theological 
proscription. Proscription of smoking some- 
times does occur. Among Protestant denomina- 
tions, it is usually those of less social standing 
(19)—e.g., Fundamentalists—who condemn 
smoking, while those of more fashionable status 
—e.g., Episcopalians—do not. The lower- 
middle-class focus of nonsmoking may be thus 
overdetermined. At any rate, we may find over- 
lap among smoking habits, social status, and 
piety. 

Earlier in the course of the study, Heath 
had set up ratings of the devoutness of the 
participants and also of their families. These 
ratings were based on considerable information 
about theological beliefs, church attendance 
and activities, personal use of prayer, etc. 
(No attention was paid to vices, great or small, 
and hence smoking in itself did not enter this 
rating.) The boys who did not smoke when 
they entered the study had more devout 
parents (.05). They were also more likely 
(.01) to attend church while at Harvard, 
despite the absence of any chapel requirement. 
Lifelong nonsmoking is associated even more 
strongly (.02) with devoutness of parents. It 
would seem that the individual lastingly 
introjects his smoking morality! Indeed, the 
nonsmokers are often rated higher on devout- 
ness than their parents. 

This introjected piety seems to be more an 
individual matter than we expected. In the 

* Davis, F. Cigarette Smoking Motivation Study. 
This is an unpublished monograph prepared by Re- 
search Services Limited, to whom we are grateful for 
access to it 
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study data, nonsmoking is the modal pattern 
for the members of no one religious denomina- 
tion. Nor did the interaction between piety 
and status work quite as expected. It is true 
that, as expected, most Protestant families 
with low incomes were rated high on devout- 
ness and produced sons who were nonsmokers. 
Presumably, we see here the introjection of 
pietistic standards, at least in the form of 
“worldly asceticism,” in the absence of theo- 
logical proscription. 

The nonsmokers may represent 
not only of the protestant ethic and middle- 
class morality but also of the related phenome- 
non Riesman (20) calls “Inner Direction.” For 
example, in answering the Strong Vocational 
Interest Blank, more sophomore nonsmokers 
(.03) said they preferred “nights at home” to 
“nights away from home,” and smokers said 
the opposite. Also, nonsmokers preferred 
(.02) “belonging to few societies,’ smokers 
“belonging to many societies.” These trends 
were especially marked among nonsmokers 
from public schools, whose earnest individual- 
ism is thus reaffirmed, but private-school 
nonsmokers showed some of the same trend. 

A curious sidelight may be interpreted 
variously: more sophomore smokers (.01) say 
they would prefer “preparing the advertising 
for the machine,” in the appropriate Strong 
item, while nonsmokers prefer not to. It is 
also true (.05) that more smokers responded 
“Like” to the occupation of Advertiser, non- 
smokers responding “Dislike.’’ Perhaps the 
nonsmoker is so Inner Directed that he does 
not have his “radar” tuned to the latest fad 
in mass consumption. He is, perhaps, not yet 
part of the consumer ethic that is said (18) to 
be appearing now among the middle class. 

A related finding, Strong-like 
items in a recent questionnaire, is that smokers 
respond “Like” to the occupation Sales 
Manager (.01) but “Dislike” to the occupation 
Scientific Research Worker (.01). This attitude 
is reflected in career choices: the nonsmokers 
contribute more physical (.01). 
These facts overlap the status differences 
already observed; few private-school boys go 
into science but many go into business 
Whether the ethos of the physical scientist 
overdetermines his nonsmoking over and above 
the effect of his middle-class and 
upward mobility is not clear from our data. 


survivals 


based on 


scientists 


origins 
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Certainly the two—mobility and _ science 
when combined pull powerfully: of the twelve 
public-school boys from low-income families 
who became scientists, ten never smoked 
and one tried it but stopped. 

In summary, the nonsmoker seems to have 
been oriented by the mores of a particular 
American subculture. He is often of lower- 
middle-class origin and himself upwardly mo- 
bile. He shows the “worldly asceticism” that 
has stemmed from the old Protestant ethic. 
Often he is pious, perhaps more so than his 
parents. It seems likely that he has reacted to 
smoking as being one of the “‘small vices” to 
which the flesh is heir. He is, at any rate, an 
Inner-Directed person, introjecting the morals 
of his youth, perhaps a serious sort, and maybe 
an introvert. He does not go along with the 
suggestions for a consumer morality offered 
by the mass media. He approves scientific 
rather than business values and may often 
himself be a scientist or engineer. Just what 
causes underlie this web of correlations is not 
clear, but existing theory about the ethos of 
the Old Middle Class or about the American 
core culture seems relevant. The standards 
this man has introjected furnish sufficiently 
“strong counteranxiety” to prevent his sharing 
the orientation toward smoking that seems to 
be common to all the rest of American society 
both below and above him. 


Wuo SmMoKEes HEAVILY? 


Whether a man smokes or no seems best 
explained by his social orientation. Whether 
he becomes a heavy smoker seems best ex- 
plained by his personal needs. Bales (2) speaks 
of the effectiveness, man has been 
oriented toward adopting a habit, of ‘‘the 
degree to which the culture operates to bring 
about acute needs for adjustment, or inner 


once a 


tensions, in its members. There are many of 
these: culturally-produced anxiety, guilt, con- 
flict, suppressed aggression, and sexual ten- 
sions of various sorts may be taken as 
examples.” 

There is no doubt that our very few really 
heavy smokers, who meet the criterion for 
Heavy Smoking now customary in medical 
research—‘‘two packs a day for several 
years’”’—have more than their share of “acute 
needs for adjustment.’’ Most have had marital 
problems, some quite dramatic. All are given 
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to impulsive acts, some to physical violence, 
if only in the form of volunteering for danger- 
ous missions. Several are hard-driving, tough 
competitors. None are usual for our group. 
As one observer phrased it, “‘they are men who 
ive in overdrive!” Their stories are told by 
Heath.‘ 

That the need to smoke heavily might be 
generated by anxiety is, of course, a common- 
sense hypothesis. Our data warns us to view it 
with caution. We have not been able to corre- 
late changes in smoking with changes in the 
tenseness of a man’s life situation, in spite of 
the ‘‘clinical hunch” of staff members that 
this relationship héld. Perhaps anxious smok- 
ing is episodic, so that our yearly question- 
naire is too coarse a measure of it. Perhaps 
something is to be learned from the answers to 
a question about the symptoms of stress that 
our participants felt when under pressure. 
Half of the smokers said they “smoked more” 
in these circumstances. The interesting fact, 
though, is that 70 per cent of the heavy smok- 
ers but only 30 per cent of the light smokers 
said this. Is smoking, then, a suitable tension 


reducer only after it has become a firmly 


ingrained habit? One thinks of learning 
theory: a response must be “high in the habit 
hierarchy” before it has the “availability” 
to reduce tension. Or one may think that this 
little item is a rather pretty confirmation of 
Bales’ contention that only after thorough 
orientation can a compulsive habit be used to 
satisfy anxious needs. 

rhe most striking correlation between heavy 
smoking and personality is found in our ink- 
blot test material. In the early days of the 
study, Dr. Wells (24, 25) used an abbreviated 
Rorschach procedure, allowing one minute for 
response to each of the standard blots, which 
he called the Timed Rorschach. The resulting 
scores are not identical with Rorschach 
scores but seem to be interpretable by similar 
theory. The Rorschach variable we concen- 
trated on is the Experience Balance, defined as 
the ratio of the number of human movement 
responses to the weighted sum of the various 
kinds of color responses. If both members of 
this ratio are equal to or less than 2:2, one 
speaks of a “‘coartated” Rorschach, by which 
one means an emotionally narrow, drily fac- 

‘Heath, C. Differences between Smekers and Non 
Smokers, in press. 


tual, excessively controlled performance. If 
either member of the ratio exceeds 2, one sort 
or another of emotional expression has taken 
place, and the label “coartated” no longer 
applies. 

Heavy smokers produce more (.05) coartated 
records. (Our definition of Heavy Smoking is 
now loosened to include men with lifetime 
averages of a pack a day, so as to include a 
workable number of cases.) What is more, 
coartated people tend (.01) to increase the 
amount of their smoking strikingly as the 
years go by. This is not a universal trend; 
many of our men reach a smoking pleateau 
early or vacillate between nonsmoking and 
moderate smoking. The coartated person is 
common among that plurality of our people 
whose smoking curve “snowballs,” showing a 
positive acceleration. The appearance of these 
curves suggests to the eye that they be inter- 
preted as showing accelerating habituation. 
Interestingly, such curves often appear among 
people who started smoking late. (They are 
not just “catching up,” however; they soon 
surpass the smoking rates of men who started 
earlier.) 

This association between accelerating smok- 
ing rates and coartation is strong enough to 
override the other trends so far reported. A 
coartated man tends to become a heavy smoker 
no matter what his social background or 
how pious his family—or himself. His orienta- 
tion may show in his being a nonsmoker in 
college if he came from a nonsmoking type of 
background. Soon after college, he typically 
begins to smoke, and the habit accelerates. One 
is reminded of Bales’ (2) comment that, “there 
is reason to believe that if the inner tensions 
are sufficiently acute, certain individuals will 
become compulsively habituated in spite of 
opposed social attitudes.” 

Are our coartated people that badly off? 
There is no very detailed Rorschach theory 
(9, 21) from which one might interpret their 
tests. They do seem to indicate a certain lack 
of emotional resources—or unwillingness to 
use them. Understandably, these men were 
rated by the psychiatrists (26) as inarticulate, 
pragmatic, and bland. They were not necessar- 
ily thought to have inner tensions. They were, 
at any rate, quite hard to get to know. Many 
were rated “Just-So” by the psychiatrist; 
that is, they were compulsive fiddlers, desk- 
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arrangers, people who allayed their tensions 
through fussy activities, of which smoking 
could understandably become one. This very 
striking finding about coartation remains hard 
to explain. Nor have we found the means to 
cross-validate it. Since the college Rorschach 
pattern predicts lifetime smoking, another 
long-term study would be needed for ideal 
cross-validation. 

Not all heavy smokers are coartated, of 
course. Among the noncoartated 
there is a tendency for the associations between 
amount of smoking and some of the psycho- 
social variables previously discussed to be 
particularly “clean.” It is as though coartation 
washes out the array of emotional variations 
between persons as well as, perhaps, the 
variability of the smoking rate. Among these 
noncoartated subjects, it does appear that 
the psychosocial variables have more to do 
with the difference between nonsmoking and 
moderate smoking (as discussed above) than 
with heavy smoking. 

In the noncoartated group, moderate smok- 
ing and heavy smoking are respectively 
related (.001) to the psychiatrist’s rating of 
Strong Basic Personality and Weak Basic 
Personality. One presumes that the latter 
rating is appropriate to the rather uncon- 
trolled men who were described as being among 
our very heaviest smokers. It is not true that 
everyone who smoked more than a pack a day 
over his adult life was poorly integrated. It is 
true that poorly integrated people were much 
more commonly found among these heavier 
smokers. Again, one sees a pattern such as 
Bales would predict: orientation leading to 
moderate smoking but “‘some need or complex 
of needs for adjustment” leading to excessive 
habituation 

In summary, then, we may hypothesize that 
starting to smoke is largely brought about by 
one’s social environment but that reactions to 
smoking, once it has started, seem to depend in 
good part on the personal needs that the newly- 
established habit is able to gratify. Some people 
seize on the habit compulsively. These people 
may often be emotionally constricted types for 
whom there is great gain in a simple “flight 
into behavior’ or they may be restless, active 
men, for whom smoking is just one more 
impulsive activity. It would also seem that 
anxious people can seize on smoking as a 


subjects, 
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tension reducer if they have already, for other 
reasons, been oriented toward it. In short, the 
habit, once well available, increases in strength 
if it serves well the person’s emotional economy. 


Wao Can Stop SMOKING? 


As every habitual smoker knows to his 
sorrow, ability to quit or cut down decreases 
as smoking increases. If we divide our lighter 
from our heavier smokers at an adult lifetime 
average of half a pack a day, the contingency 
relation between “lighter” and “heavier” vs. 
“can stop” and “can’t stop,” as shown in 
recent responses to questionnaires, gives a p 
less than .001. Even more striking is the march 
(.01) of the mean numbers of cigarettes smoked 
per day during adult life. For men who can 
stop smoking this figure (computed only during 
their smoking years) is 9, for men who don’t 
try to stop it is 18, and for those who cannot 
stop it is 20. The sheer amount of tobacco so 
far consumed is by far the largest difference 
between the group of men who can stop and 
the group why, at least so far, cannot. 

The variables that were related to becoming 
a heavy smoker seem also to bear some relation 
to ability to stop. Thus, coartated Rorschachs 
were frequent among people who became heavy 
smokers but it is also true that, with heavy 
smoking held constant, the presence of a 
coartated Rorschach seems related to in- 
ability to cut down or stop. Some of our 
scientists have smoked; they could easily 
stop. (They were not heavy smokers, of 
course.) Perhaps the scientists show that it is 
easier to quit when one’s compeers do not 
smoke: none of our five writers can quit. 
Among our heavier smokers, the psychiatric 
iabels of Sociable, Strong Basic Personality, 
and Practical characterize mostly people who 
can stop; Weak Basic Personality, Asocial, 
Lack of Purpose and Values, Introspective, 
Ideational, and Inhibited characterize mostly 
people who cannot. These are all small trends, 
however, and mostly occur in small numbers of 
cases. Yet these relationships seem worth 
mention because of their congruence with 


general theory. The variables related to smok- 
ing compulsively are mostly “‘need”’ variables, 
as the Bales theory would require. 

A signal fact is this: ability to stop smoking 
is directly proportional to the number of 
months our subjects were fed from their 
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mothers’ breast! The means march (.05) as 
follows: light smokers who can stop were 
weaned at 8.0 months; heavy smokers who can 
stop were weaned at 6.8 months; smokers, 
mostly heavy, who don’t try to stop were 
weaned at 5.0 months; smokers, mostly heavy, 
who try to stop but cannot were weaned at 4.7 
months. We had previously explored for a 
possible relationship between smoking and 
weaning or amount smoked and weaning, but 
it was not until we explored ability to stop 
smoking that the relationship to breastfeeding 
became so cleancut. 

Many wiil wish to explain such a finding 
away. Certainly we would not argue that 
weaning is the cause of smoking. The argu- 
ment must proceed through some fertium quid. 
In this we would agree with Linton (12), who 
stresses that such a crude datum as date of 
weaning or any other data “focussed primarily 
on actual technical operations, without a 
correspondingly detailed study of the maternal 
attitudes which accompanied these per- 
formances,” is naive. The events of infant 
training are part of a broader pattern and 


symptomatic of it. “We must remember that 
these influences operate on the child from 
birth, and continue to operate on him for a 


long period of time... .” The congruence of 
infant training, childhood rearing, and adult 
values is shown by McClelland (15) for two 
contemporary American subcultures. So, in 
our data, late weaning was associated with 
those personality traits that are also related to 
ability to stop smoking. We do not have to 
postulate that infantile frustration and adult 
cigarette smoking are unmediated cause and 
effect. 

Yet, it is a commonplace that people who 
stcp smoking seek other oral gratifications. 
All the “small vices” we have correlated with 
smoking are oral vices: alcohol, coffee, and 
tobacco are taken in through the mouth. So is 
sugar, which Heath shows to be related to 
smoking. More directly to the point: Brozek 
(4) has experimentally documented the notion 
that men who give up smoking gain weight. 

Theoretical reasons suggest that smoking 
should be correlated to psychoanalytic “oral- 
ity.”” Our empirical data suggests that it is. 
Groups with the ability to stop smoking con- 
tain a smaller percentage of bottle babies. 
Thumbsucking was more commonly reported 


(close to .05) for men who continued to smoke. 
If one does not think of a psychoanalytic ex- 
planation, then a drive-reduction theory like 
that of Levy (11) seems to fit the data. Nor 
would these ideas be inconsistent with the 
Bales model: what they suggest is that these 
“deeper” needs have little or no effect on 
whether one smokes but great effect on how 
tenacious the habit, once adopted, may 
become. 

In summary, the ability to stop smoking is 
grossly related to the amount of tobacco one 
has consumed. Good mental health, such that 
one has control over one’s habits in general, 
seems to be relevant. So, apparently, is oral 
gratification received as an infant. This 
“orality” factor may be mediated in many 
ways, but its meaningfulness should not be 
overlooked. 


THE RELIABILITY OF THE FINDINGS 


To check the reliability of some of these 
findings, we drew five per cent random sam- 
ples of the class of 1958 and the class of 1961 
from the files of the University Health Services. 
(Ns were 55 and 58.) Every freshman fills in a 
medical questionnaire that contains an item 
about smoking as well as several items relevant 
to the variables related to smoking in the 
Study of Adult Development. Clearly, only 
the criterion of smoking in college (or near the 
start of college) was available. However, inso- 
far as these checks bore out our findings, they 
showed them to be reproducible over a period 
(1939 to 1957) of almost twenty years. 

Both samples showed an excess (.01) of 
public-school nonsmokers. In the first sample, 
the only one for which income data was avail- 
able, it was noted again that income was not 
as good a predictor of smoking behavior as was 
type of preparatory school. 

The relationship between seriousness of pur- 
pose and nonsmoking may be adumbrated by 
one datum. Within the public-school group, 
there is in both samples an excess of non- 
smokers among those boys who characterize as 
Definite their pre-college choice of career. 
Similarly, in the sample for which income data 
is available, there is a strong tendency for the 
boys who are both Protestant and poor net to 
smoke. These would seem to be likely to be the 
lads for whom Harvard meant hard work and 
mobility. Or, viewed another way, these would 
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seem likely to include most of our members of 
Fundamentalist denominations. 

In both freshmen samples, those boys who 
announce their intention of going into en- 
gineering or science tend not to smoke. 


The examing physician rated these fresh- 
men on “Personality Integration.” For both 
samples, the percentage of smokers marches 
upwards as the ratings become less favorable. 
For the combined samples, the percentages of 
smokers are: for men rated “A,” O per cent; 
for men rated “B,”’ 23 per cent; for men rated 
“C,” 37 per cent; and for men rated “D,” 50 
per cent. There were few A’s and few D’s. The 
contingency table made by combining A and 
B ratings in one column and combining C and 
D ratings in the other and setting these ratings 
against smoking and nonsmoking gives a chi 
square of 3.6, so p is not quite as low as .05. 

A similar rating by the examining physician 
was a prediction of College Adjustment. The 
percentages again march (though most clearly 
for the combined samples) as follows: among 
those rated “A,” there were 13 per cent 
smokers; among the “Bs,” 23 per cent; and 
among both “Cs” and “Ds,” there were 50 per 
cent. Combining A and B and C and D, one 
gets a contingency table from which chi 
square is 8.0 and is less than .01. Both these 
findings from the physicians’ ratings would 
seem to parallel the study findings that smok- 
ing went with psychiatric ratings suggesting 
poorer mental health. 

The study finding that smoking and drinking 
are correlated was amply confirmed for the 
class of ’58 but not so strongly shown in ’61. 
In both samples, the tendency for drinking to 
precede smoking was noted. 

On the whole, then, where we could cross- 
check the study findings, they seemed to hold 
up well. In this connection, it may be worth 
pointing out that a few of the patterns reported 
from the Harvard data have been found else- 
where. An unpublished survey of Yale seniors 
shows the public-private differences in smoking 
habits still pronounced at the end of the 
college career. The priority of drinking to 
smoking is noticeable. (This was, of course, 
first noted at Yale by Straus and Bacon.) The 
introjection of family standards shows very 
clearly as a source of smoking mores. We have 
already cited notable corroboration from 
market research done in England, where the 
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association of nonsmoking with white-collar, 
middle-class identity was amply confirmed, as 
well as the definition of smoking as one of the 
minor vices. The same pattern is suggested by 
a national sample questioned about their 
smoking habits by the U. S. Bureau of Census 
(5). Results already cited from the University 
of Minnesota are consistent with our findings. 

The one area in which no cross-validation 
has been possible has been that of psychody- 
namics, especially the findings from the Timed 
Rorschach. Since these personality patterns are 
very relevant to a theory of smoking, it is to 
be hoped that something may be done to check 
them. 


SUMMARY 


A large part of what we have learned about 
the correlates of smoking habits may be made 
to fit a conceptual model like that of Bales. 
The fact that a man smokes or does not seems 
to be determined by whether or no he has 
been oriented to the habit as a result of his 
social milieu. Whether he becomes a heavy 
smoker or is unable to stop smoking seems 
determined by the usefulness of the smoking 
habit to his personal needs. 

Nonsmokers tend to be lower-middle class 
in origin, upwardly mobile, earnest young men, 
bred in a work morality that is conducive to 
Inner Direction. Their parents and they them- 
selves are often pious. They may pursue scien- 
tific or technical careers in many instances. 
Smokers are (in our data) likely to come from 
more privileged backgrounds, often entering 
business or humanistic careers, often having 
been raised in a Being or Being-in-Becoming 
orientation. Both subculture and the family as 
the mediator of the subculture are important 
determinants of whether and when the young 
man is oriented to the smoking habit. 

Whether a smoker becomes a heavy smoker 
seems to depend on whether the habit serves 
many of his important needs. Very anxious or 
agitated men may adopt smoking as a tension 
reducer, but this use of smoking seems to be 
common only when the habit is already well 
established by other circumstances. Emo- 
tionally-constricted individuals seem to “‘take 
to” smoking with special eagerness. 

Whether a man can alter his smoking habits 
seems to be most “deeply” determined. 
Efforts to quit or cut down seem to be normal, 
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since smoking is quite widely seen as a “‘small 
vice,” even by smokers. Whether a man can 
succeed in these efforts is first of all a function 
of how much tobacco he has consumed in his 
lifetime. Certain personality variables are also 


relevant. In general, it is the needs that lead to 
heavy smoking after the habit has begun that 
also militate against quitting or cutting down. 
The social variables that were related to start- 
ing to smoke play little role in inability to 
quit. The effects of oral gratification at the 


breast may be relevant to control excessive 
smoking, once a great deal of tobacco has 
been consumed. 

Where data were available, the study find- 
ings were cross-validated against samples 
from two Harvard classes, with generally en- 
couraging results. Some confirmation of other 
findings has been had from unpublished studies 
at distant points and from a nationwide study 
by the Census Bureau. 

Not all that we know about smoking fits 
into the proferred conceptual scheme. However, 
the scheme outlined makes our major findings 
hang together and may serve as a first hypoth- 
esis in further exploration of the psychology of 
smoking habits. 
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ATTITUDE CHANGE AND JUSTIFICATION FOR COMPLIANCE! 


ARTHUR R. COHEN, JACK W. BREHM, anp WILLIAM H. FLEMING 


Yale University 


ig has been demonstrated in a number of experi- 
ments (1, 3, 5, 6) that forcing a person to take 
an opinion position discrepant from what he 
privately believes results in change of private 
opinion toward the enforced one. The question 
then arises as to what variables affect the extent 
of private change under such conditions of compli- 
ance. This problem has recently been discussed by 
Festinger (2). According to his analysis, expressing 
an opinion discrepant from one’s privately held 
position creates “dissonance” or psychological 
tension having drive character. The tension may, in 
this case, be reduced by changing one’s private 
position so that it more nearly coincides with the 
position expressed. 

While the direction of the private attitude 
change may thus be understood, what, according 
to dissonance theory, controls the magnitude of 
change? Both the number and importance of 
reasons in favor of and against expressing the 
discrepant opinion control the magnitude of 
dissonance. Given the expression of the discrepant 
opinion and holding constant the number of 
reasons for not expressing it, the greater the 
number and/or importance of reasons for ex- 
pressing it, the less will be the dissonance. From 
these considerations we may state the major 
hypothesis: with the discrepant expression and 
reasons for not making it held constant, the greater is 
the justification for making it, the less will be the 
magnitude of dissonance and consequent attitude 
change. 

In addition, with the reasons for and against 
taking the discrepant position held constant, the 
stronger the expression of the discrepant opinion, 
the greater will be the dissonance created and 
therefore, the greater will be the resultant attitude 
change 

An experiment by Kelman (4) 
the effect on private attitudes of using specific 
nonsocial rewards to force compliance. The present 
study emphasizes another type of motivational 


has examined 


induction, that of giving reasons unrelated to the 
attitude issue for expressing the discrepant opinion. 
In the present experiment, then, justification refers 
not to the attitude issue but to reasons for taking 
a view discrepant from one’s private opinion 


‘This report was prepared as part of the Yale 
Communications Research Program. 


METHOD 


Subjects (Ss) were 92 Yale College undergraduates 
who were approached in their classrooms and given an 
anonymous questionnaire. The questionnaire asked 
for their opinions on a salient issue, instructed them to 
write an essay against their position, obtained their 
opinions on the issue again, and measured their per- 
ceptions of the experimental situation. 

The experiment was conducted shortly after a 
rumor that Yale would soon become coeducational had 
created considerable stir on the campus. Thus the 
issue of attitude toward coeducation at Yale was 
chosen as the salient attitude dimension. It was ex- 
pected that most students would show an initial bias 
against coeducation. 

The Ss were first asked to indicate their attitudes 
toward admitting women undergraduates to Yale 
The scale used was a “pick-two” device; from six 
choice points a ten-point scale was generated. Ss were 
then asked to write an essay taking a stand opposile 
to the oné initially indicated. This procedure was 
assumed to constitute the expression of an opinion 
position discrepant from the Ss’ private opinion. The 
instructions were the same for both experimental 
conditions except that the high justification group was 
given a number of reasons why the S should comply 
with the request to write the essay against his position. 

In the high justification condition, the Ss (N =29) 
were told the following: E appreciated their help 
since the study represented considerable research 
effort on his part; it was a seminar research paper which 
E had to get done; Z& had received a great deal of 
cooperation, interest, and willingness to help from 
other students; the ability to stand back and consider 
the other side of a complex issue was a real asset; 
considering the other side of the issue was only fair; 
taking the other side facilitated a good grasp of a 
complex issue; and the results of such a survey could 
be of interest to social scientists and the university 
authorities. They were given these cumulated argu- 
ments and urged to do the best possible job. Ss in the 
low justification condition (N=63) were simply given 
the instructions common to both experimental groups, 
which asked them to write the most productive, cre- 
ative, and imaginative essays possible, unequivocally 
against their own position. In order to minimize the 
pressure exerted to comply because of the very nature 
of the classroom situations, the Ss were assured that 
they need not comply at all if they did not want to. 
Such perceived pressure due to classroom constraints 
might be expected to reduce dissonance and prevent 
variations in attitude change from emerging. 

After writing their essays, the Ss were again asked 
to indicate their attitude toward the issue in the light 
of additional thought, and also to respond to a post- 
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experimental questionnaire which attempted to measure 


the success of the experimental inductions. 


RESULTS 
Check on the Experimental Inductions 


Data from the postexperimental questionnaire 
indicated that the experimental inductions were 
effective in creating differences between the two 
groups in justification for compliance. The low 
justification group perceived that less pressure to 
comply was exerted upon them than did the 
highs (p < .02), they were more self-motivated to 
write the essay than the highs (p < .02), and 
they showed fewer interfering responses than the 
highs (p < .01).? 


Amount of Compliance and Attitude Change 


All Ss in both conditions complied. It may thus 
be inferred that the general force to comply was 
little attitude change is 
therefore to be expectec. The average change in 
the high justification condition is .17 and that in 
the low is .46; both values are in the expected 
direction but neither is significantly different 
from no change. 


very compelling, and 


Amount of Justification and Altitude Change 


The major hypothesis concerns the relationship 
between dissonance created by compliance and 
attitude change toward the assumed position. It 
was expected that the condition of low justification 
would create more dissonance and consequently 
more attitude change than would that of high 
justification. The obtained difference between the 
two conditions in mean change, .29, is in the 
predicted direction but is not statistically sig- 
nificant. 

When the mean “before” opinions on the issue 
were examined for the high and low justification 
groups, the low group was found to be significantly 
more negative (Af = 6.50 for the highs, 7.75 for 
the lows, p < .02). It was expected that negative 
attitudes toward coeducation would be shown 
more by upperclassmen who had internalized the 
Yale than by who had 
more recently come from secondary schools. In 
line with this expectation, the low justification 
group was found to contain far more upperclass- 
men than the high justification group, which 
contained relatively more lowerclassmen. The 
p value is beyond the .001 level. 

The implication of these relationships between 


values lowerclassmen 


experimental condition, extremity of attitude, and 
class at Yale becomes evident when the data on 
extremity of position and attitude change are 


2 Except where specifically mentioned, all statistical 
tests in this report are two-tailed tests. 


examined. The results show that extremity of 
position on the attitude scale is inversely related 
to change in attitude ( < .001). Since more of 
the extremes are in the low justification group, 
probably due to the unequal distribution of upper- 
classmen in the two groups, and since extremity of 
position inhibits attitude change, it is difficult to 
test the hypothesis put forth. Therefore, it seems 
reasonable to eliminate the extremes (all those Ss 
who began at 10 plus the few who began at 1) from 
the sample in order to correct for this bias; the 
hypothesis thus is tested within a selected popula- 
tion (NV = 28 for the low justification condition, 
N = 17 for the high justification condition). After 
selection, the means on the “before” opinion are 
now virtually identical: 5.80 for the highs, 5.72 
for the lows. 

One indication that the selection procedure is 
justified concerns the quality of the essays written. 
When the selected population is compared with 
the extremes on the number of original arguments 
given in their essays, 49 per cent of the selected 
population give one or more original arguments, 
while only 28 per cent of the extremes give such 
arguments (p < .05). The extremes thus appear 
to have been less involved in complying than the 
selected Ss; eliminating such differential involve- 
ment provides a better test of the hypothesis. 

Comparison of the two selected groups on the 
effectiveness of the experimental inductions shows 
that they too differ on all three measures: the 
lows perceived less pressure, were more self- 
motivated, and had fewer interfering responses 
(all » values beyond the .04 level). 

When the major hypothesis is tested with the 
selected population, the lows show a mean change 
of .83, the highs a mean change of .22. The differ- 
ence in amount of change is significant by one- 
tailed ¢ test at the .07 level. A tendency is therefore 
suggested by the data for greater change to occur 
in the condition where greater dissonance was 
postulated. 


Strength of Induced Behavior 


Another assumed determinant of dissonance 
concerns the behavior which produces dissonance. 
It should hold that the better the quality of the 
essays written by the Ss against their original 
position, the more dissonance is created and, 
therefore, the greater should be the attitude 
change in line with the induced position. The 
essays were coded for number of original argu- 
ments, on the assumption that the improvisation 
of original arguments is a clear indication of the 
degree to which the S was actually involved in 
expressing the discrepant opinion. As such, the 
measure of original arguments represents the 
strength with which the discrepant opinion was 
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expressed (3, 4). Among the selected Ss, a relation- 
ship between number of original arguments and 
attitude change was found only for the high justifi- 
cation group, the group where it was assumed 
little dissonance would be created. This relation- 
ship is significant at beyond the .001 level by 
chi-square test. For the low justification group 
where high dissonance and high attitude change 
occur, no additional dissonance is created by high 
quality of essay and, therefore, there are no 
variations in attitude change as a function of 
number of original arguments. In effect, where 
dissonance is not created by the experimental 
variables, it can occur as a function of individual 
variation in the strength of the induced behavior. 

It might be argued that those who write high 
quality essays do so because they feel greater 
justification. This hypothesis is counter to our 
present reasoning since in the high justification 
condition, quality of essay is related to attitude 
change. However, no reliable relationships were 
found between quality of essay and perceived 
pressure (experimental checks on justification 
manipulations). Furthermore, it is clear that this 
hypothesis will not explain the major finding, 
namely, that attitude change tended to be greater 
where manipulated (and perceived) justification 
was least. 

SUMMARY 

This experiment has tentatively illustrated 
some of the processes involved in creating and 
reducing psychological dissonance and the implica- 
tions of such dissonance for attitude change in 
compliance situations. It was hypothesized that 
increased justification for taking an opinion 
position discrepant from one’s private opinion 
would lead to decreased dissonance and therefore 
decreased attitude change toward the discrepant 
position. Ss were asked to write an essay taking 
a stand opposite to their initial opinion on a given 
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issue. One group of Ss was then given a number of 
reasons for compliance with the request (high 
justification condition). For another group (low 
justification condition) Ss were given no detailed 
justification for writing the essay. Attitude change 
was determined from a postexperimental question- 
naire. 

Three sources of dissonance were explored: (a) 
for the whole population, low dissonance and thus 
little attitude change were obtained because of 
the forceful nature of the experimental situation 
which brought general compliance; (6) high 
dissonance and therefore high attitude change 
tended to be associated with the low justification 
condition; and (c) in the high justification con 
dition, variations in the strength of the induced be- 
havior brought variations in dissonance and thus 
subsequent variations in attitude change toward 
the assumed position. These results should be ac- 
cepted with caution since they arise from post 
hoc analyses. 
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AGGRESSION, ACQUIESCENCE, AND QUESTIONNAIRE 
CONSTRUCTION 


MAURICE L. FARBER 


University 


N a recent paper (3) Rabinowitz attempts to 
I demonstrate that Farber’s (2) reported corre- 
lation between anality and political aggression can 
be explained away as a result of an acquiescence 
response set. There appear, however, to be serious 
defects in Rabinowitz’s methods that render his 
conclusions doubtful. Christie, Havel, and Seiden- 
berg’s (1) recent cogent and detailed analysis of 


of Connecticut 


acquiescence is directly relevant; the present note 
will therefore be limited to brief comment on 
several features specific to Rabinowitz’s study. 

1. Rabinowitz’s critique is based upon the con- 
struction of new Anality and Political Aggression 
scales which are scored negatively for these char- 
acteristics. In this difficult task (vide Christie et al.) 
he does not seem to have succeeded. Two items of 
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his five-item Anality scale (the second and fifth) 
and three of his four-item Political Aggression scale 
(first, second and fourth) appear highly doubtful. 
For example, is a “no” answer to “When I go toa 
restaurant I like to order foods which I have never 
tasted before” an anal reply? Or, is disagreement 
with the statement “The foreign policy of the 
United States should be based on the assumption 
that most other countries are our friends, not our 
enemies” an indication of political aggression? The 
rejection of a simplistic, soft-headed liberalism is 
not identical with active political hostility. Far- 
ber’s anality items hew very closely to the Freudian 
formulations, and his political aggression items in- 
volve affirmation of actively aggressive feelings. 

It is therefore not surprising that the correlation 
between Rabinowitz’s scales and Farber’s is but 
+.12 for anality and +.16 for political aggression. 
While no ultimate measure of validity is available 
for either set of scales, it is clear that the identically 
labeled scales are measuring something quite differ- 
ent. Since Rabinowitz’s method requires psycho- 
logically equivalent scales that are worded in 
opposite directions, his failure to achieve these 
renders his critique irrelevant. 

2. No item analysis is presented of Rabinowitz’s 
scales, nor, perhaps more important, is there any 
indication that the discriminatory power of his 
items was tested. In view of the lack of theoretical 
as well as their 


relevance of some of his items 


INDEPENDENCE AND 


DOUGLAS 


Pennsylvania 


i ee study was designed to test an hypothesis 
concerning consistencies in cognitive and 
personality organization. A second purpose was to 
provide further validation for previously observed 
relationships reflecting these hypothetical linkages. 
The general hypothesis may be stated as follows: 
individuals are consistent in their styles of response 
to field forces at different conceptual levels. Thus, 
Ss who show personality characteristics consistent 


1 This research was completed while the author was 
at The Menninger Foundation as a U. S. Public Health 
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possibly low discriminatory power, these omis- 
sions are serious ones. 

3. Rabinowitz chose to drop Farber’s modified 
Likert-type of scoring of the Political Aggression 
scale and to substitute simple agree-disagree 
categories. By so doing he precluded the possibility 
of isolating and properly weighting the genuinely 
aggressive response expressed in the “agree 
strongly” category, thereby dissipating much of 
the value of the scale. There is evidence, for ex- 
ample, that acquiescence responses are most fre- 
quent among those who agree slightly but rela- 
tively rare among those who agree strongly (1). 

When one considers these methodological de- 
ficiencies in conjunction with Christie’s demonstra- 
tion of the small role played by acquiescence in 
items similar to those employed here, the lack of 
validity of Rabinowitz’s strictures upon Farber’s 
reported relationship is apparent. 
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TO PERCEPTUAL FIELD FORCES' 


N. JACKSON 


State University 


with independence from group pressure should also 
be able to resist forces in the perceptual field. 
Jackson (4) has developed a reliable measure of 
the ability to resist reversals of perspective and has 
obtained a significant relationship between this 
measure and the tendency to disagree with F-scale 
items. Hypotheses provided by Witkin (7) and by 
Linton (6) regarding consistencies in perception 
and personality led the author to view this task as 
one in which S must cope actively with the per- 
ceptual field forces that cause figure and ground to 
reverse. Without necessarily invoking the hypo- 
thetical “brain fields” of the Gestaltist, it is 
possible to consider the inevitable and lawfully 
occurring reversals of perspective as manifestations 
of hypothetical forces in the visual field. Individual 
differences on this measure might be considered 
more generally as an indication of an S’s level of 
energy or activity in organizing perceptual and 
cognitive field forces, rather than submitting to 
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them. In testing this hypothesis, Jackson (5) found 
in a preliminary study that two criterion measures 
of this perceptual dimension correlated signifi- 
cantly and negatively with a measure of behavioral 
conformity and one of passivity on a group of 
males. The present study seeks to explore further 
the generality of this hypothetical dimension with 
a more economical group procedure. 


METHOD 


To order people conveniently with respect to per- 
sonality characteristics associated with independence 
of judgment, the personality items that empirically 
distinguished “independents” from “yielders” in con- 
formity experiments by Barron (1) and by Crutchfield 
2) were combined into an “inventory.” Barron, using 
Asch’s procedure, and Crutchfield, using a similar 
technique, placed S in a situation where experimentally 
created group pressure to conform was put in conflict 
with veridical perception and judgment. S had to 
choose between violating his own perception or pub- 
licly facing the prospect of being at variance with the 
judgment of the group. Behavior in these situations 
provides an index of an S’s susceptibility to external 
influence, of his capacity to withstand social field 
forces 

The inventory contained 45 true-false items. It was 
administered to groups of college students, 89 men and 
54 women in all. The age range was 17-40, with a 
median of 20 years. 

Iwo groups, “independents” and “yielders,” were 
selected from the extremes of the distribution of 


The Ss were administered a 


“con 
formity inventory” scores 
group procedure adapted from one developed by Hoch- 
berg and McAlister (3) for determining the time one 
phase of a reversibie figure appears to be in perspective. 
In this technique, a large India ink drawing of a Necker 
cube (115¢ in. wide) was presented with explanatiun of 
the nature of the reversals of perspective. After a brief 
Ss were asked to attempt to hold in per- 
spective the “up” phase (which was outlined) and to 
indicate by writing a “D” or a “U” which phase was in 
perspective just prior to the sounding of pure-tone, 
half-second “beeps.” They were cautioned to keep their 
eyes on the cube throughout the experimental series. To 
assure an equivalent procedure for all groups, the 


practice trial 


instructions and the pure tone signals were recorded on 
tape and played through loudspeakers. There were two 
trials of two minutes each. Each trial contained 24 of 
the randomly spaced signals. The signals were ran- 
domized to prevent them from being synchronous with 
rhythmic reversals that occur in some Ss. The score 
representing susceptibility to the visual field forces in 
this situation was the total number of “‘D” responses in 


the two two-minute trials. 


RESULTS 


To test the hypothesis that “independents” 
would manifest significantly greater activity in 
coping with the visual field forces in the reversible 
figure, scores from the “conformity inventory” 
were tabulated and ranked. Because there was a 
small but significant difference between males and 
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females on the inventory scores, the 27 per cent 
extreme groups were selected separately for each 
sex, thus, in effect, partialling out the influence of 
sex in making the major comparison. There was a 
total of 36 “independents” and 36 “yielders” 
selected from the extremes of the distribution of 
143 Ss. These extreme groups were then compared 
with respect to the perceptual scores, the number 
of “down” responses in the two trials of two min- 
utes each. The differences are in the predicted 
direction; the “independents” showed a mean of 
11.4 responses indicating that the ’ phase 
was in perspective (they were instructed to main- 
tain the “up” phase) while the “‘yielders” had a 
mean of 17.5 “‘down”’ responses. A single classifica- 
tion analysis of variance between the two extreme 
groups indicated that this difference was signif- 
icant (F = 17.56; df = 70; p < .001). Thus, evi- 
dence has been obtained for the hypothesis that 
“independents” may be more active in manipulat- 
ing perceptual field forces. 

Some interesting sidelights were obtained con- 
cerning some properties of the particular measures 
used in the study. For example, the corrected split- 
half reliability of the group measure of perception 
(.84) was found to compare fairly well with that 
obtained under conditions of individual administra- 
tion, where it is usually above .90. The odd-even 
reliability of the conformity inventory, on the 
other hand, was quite low; after correction for 
length it was only .54 for the men and .44 for the 
women. Lack of homogeneity is often a serious 
problem in empirically derived indices such as the 
present one. It argues against the use of this scale 
for measuring attributes of particular individuals, 
although it may be of some use in discriminating 
broad groups for research purposes. On the inven- 
tory, females were found to show a slight but sig- 
nificant tendency (p < .01) to score in the “‘con- 
forming” direction as compared with males. This 
finding is consistent with the type of develop- 
mental theory discussed by Witkin and his asso- 
ciates (7) and by others. Significant sex differences 
have not been encountered in the present measure 
of reversals of perspective, however. 

It is interesting to note that 31 of the 45 con- 


“ 


down’ 


formity items must be answered true to be scored 
in the conforming direction. Acquiescent response- 
set may therefore be an important aspect of this 


measure. Such an interpretation is consistent 
conceptually with the independence-conformity 
dimension. Jackson’s earlier finding (4) of a sig- 
nificant relationship between the perceptual 
measure and the F scale also may be consistent 
with this interpretation. 

The evidence suggests that those who charac- 
teristically resist certain hypothetical perceptual 
field forces also have the personality character- 
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istics associated with independence of judgment, a 
form of resistance to social field forces. Those Ss 
who tended to answer personality items in the 
independent direction appeared to show greater 
activity in resisting reversals of perspective. 
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EXTRAPUNITIVENESS AND ETHNIC ATTITUDE 


GERALD S. LESSER 


Hunter College 


ACKERMAN and Jahoda explain the complete 


absence of deep depression in their anti- 


Semitic psychoanalytic patients by stating, “The 
existence of an anti-Semitic reaction presupposes a 
tendency to blame the outside world rather than 
one’s self, and dynamically, such a tendency is 
in contradiction to the overtly self-destru:tive 
trend of a genuine depressive” (1, p. 243). How- 
ever, other studies (e.g., 3) have reported incon- 
clusive results concerning the relationship be- 
tween extrapunitiveness and prejudice. The 
present study examines the relationship between 
anti-Semitism and extrapunitiveness among both 
Jewish and non-Jewish boys. 


METHOD 


The subjects were 20 white non-Jewish boys (ages 
10-0 to 13-1) and 24 white Jewish boys (ages 10-2 to 
13-2) drawn from one fifth-grade and two sixth-grade 
classes in two public schools in New Haven, Connecti- 
cut. The schools are in adjacent districts and the fami- 
lies constitute a relatively homogeneous upper lower- 
class group 

Extrapunitiveness was measured through the Chil 
dren’s Form of the Rosenzweig Picture-Frustration 
Study (5). The mean and standard deviation of the 
percentage of extrapunitive responses for the non- 
Jewish boys were 54.90 and 15.04 respectively, and for 
the Jewish boys were 55.24 and 13.62, respectively 
These means are not significantly different 

Modified have 
juently employed as a method for the indirect assess- 
ment of social and ethnic attitudes in children (e.g., 
2, 4). Although variables other than prejudice also 
affect friendship selection, these sociometric techniques 
applied to the measurement of ethnic attitudes assume 
that the degree to which a child accepts or rejects the 
friendship of members of an ethnic group is related to 


sociometric procedures been fre- 


the degree of prejudice he directs toward that group 
In the present study, each child was asked to select his 
three best friends in his class and also to select the three 
boys that he wished were not in his class at all. An 
“anti-Semitism” score was obtained for each boy by 
subtracting the number of Jewish boys he chose for 
friends from the number of Jewish boys he rejected. A 
range of scores of +3 te —3 was obtained; the mean 
and standard deviation for non-Jewish boys were 0.00 
and 1.42 respectively, and for the Jewish boys were 
—0.44 and 1.36, respectively. These means are not 
significantly different. 


RESULTS 


Two Spearman rank-difference correlation 
coefficients (rho) were computed. For non-Jewish 
boys (NV = 20), the correlation between extra- 
punitiveness and “anti-Semitism” is +.60 (p 
<.01). For Jewish boys (NV = 24), the correspond- 
ing correlation is +.48 (p <.01). These coeffi- 
cients are not significantly different. 


CONCLUSIONS 


Previous studies (e.g., 3) have not found clear 
evidence of a relationship between extrapunitive- 
ness and prejudice. The present study reports 
significant positive relationships between extra- 
punitiveness and anti-Semitism among both 
Jewish and non-Jewish boys. 

Sarnoff (6) obtained a strong negative rela- 
tionship between extrapunitiveness and anti- 
Semitism for a group of Jewish adults. For Jewish 
pre-adolescent boys, the present study found the 
reverse relationship, i.e., a significant positive 
relationship between extrapunitiveness and anti- 
Semitism. 





CRITIQUE 


REFERENCES 
ACKERMAN, N. W., & JAaHopA, Marie. The dy- 
namic basis of anti-Semitic attitudes. Psychoanal. 
Quart., 1948, 17, 240-260. 

2. CRISWELL, JoaN H. A sociometric study of race 
cleavage in the classroom. Arch. Psychol., 1939, 
No. 235. 

3. Lrypzey, G. An experimental examination of the 
scapegoat theory of prejudice. J. abnorm 
Psyc hol., 1950, 45, 296-309. 

Moussen, P. H. The reliability and validity of the 


soc 


AND NOTES 


Horowitz Faces Test. J. abnorm. soc. Psychol., 
1950, 45, 504-506. 

5. Rosenzweic, S., Fiemme, Epita E., & RoseEnz- 
WEIG, Louise. The children’s form of the Ro- 
senzweig Picture-Frustration Study. J. Psychol., 
1948, 26, 141-191 

6. Sarnorr, I. Identification with the aggressor: some 
personality correlates of anti-Semitism among 


Jews. J. Pers., 1951, 20, 199-218 


Received June 16, 1957 


ANTICIPATED AUDIENCES AS DETERMINANTS OF RECALL 
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SCHRAMM 
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University of Wisconsin 


HIS is a study of the way in which reference 

] groups affect how human beings handle the 
communications they receive. More precisely, it is 
a study a particular reference group—the 
audience we expect to have to talk to concerning 
a given topic—and how our image of this group 
affects our handling of messages related to the 


of 


specific topic. 
Fifty-five years ago Cooley (1, especialiy pp. 


58-62) developed the concept of “imaginary con- 
versations.”” Much of our mental life, he said, is 
lived as a rehearsal of imaginary and anticipated 
conversations with who are important 
or rather, with our images of those persons. 


persons 
to us 
“The mind,” he said, “is not a hermit’s cell, but 
a place of hospitality and intercourse. We have 
no higher life that is really apart from other 
people. It is by imagining them our personality 
is built up.” 

Many scholars have speculated as to just what 
effect this kind of mind life must have on percep- 
tion, learning, and attitudes, but little or no ex- 
until in 1951. 
published a paper bused on current 


perimenting had been done on it 
Grace (2) 
developments in the theory of reference group 
and social perception. He showed subjects a large 
array of objects which ranged from intimate 
feminine to intimate masculine apparel. The 
experimental subjects were told to remember all 
these articles they could because they would 
later have to recall and repeat what they re- 
membered. Some of them were told they would 
have to repeat them to a woman; others were not. 
All of them did report to a woman. Therefore 
the amount and nature of their recall could not be 
attributed to differential social constraint at 
naming intimate articles to a woman. There was 


no significant difference between the total number 
of items recalled by the two groups of subjects. 
However, those subjects who had been told they 
were to report to a woman tended to recall “‘fem- 
inine”’ articles significantly earlier in their lists. 
The experimenter’s supposition was that these 
female-related items had been made more salient 
by the knowledge that the anticipated audience 
was to be female. 

In 1956, Zimmerman and Bauer (3) reported 
an ingenious experiment which found clearer 
effects of an anticipated audience. Miss Zimmer- 
man appeared before several classes, representing 
herself variously as speaking for an organization 
interested in improving teachers’ salaries and as 
speaking for an concerned with 
saving taxpayers’ money. She said her organiza- 
tion was seeking speakers to address its members 
on the subject of teachers’ pay, and was also in- 
terested in how well people could remember 
arguments on the subject. She read a short pas- 
sage to the classes, in half the classes favoring 
and in the other half in 
teachers’ salaries. She then asked the students to 
reproduce this passage, in writing, as accurately 
as possible. One week later she returned and asked 
them to write the passage again, as closely as 
they could remember it. Results were that at the 
end of a week, subjects remembered significantly 
more of the passage when they had been given a 
selection that was congruent with the attitude of 
the anticipated audience. That is, if a subject had 
been given a passage that opposed teachers’ pay 
raises and was told that he was to write a speech 
for the Economy League, he remembered more of 
the passage than another subject whose passage 


organization 


opposing increases 


favored pay raises and who was also to speak to 
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the Economy League. The experimenters also 
found that subjects especially concerned with the 
audience (e.g., journalism students) showed more 
effect of this anticipated audience than subjects 
chiefly concerned with the topic (e.g., education 
students). 

The present experimenters were planning a 
study of this problem at the same time as Miss 
Zimmerman was collecting her data. When they 
found out about her work they modified their 
design somewhat in order to replicate the essen- 
tial feature of the Zimmerman-Bauer experiment 
and also to include an aspect of the relationship 
that was not included in the earlier work: the ef- 
fect of an anticipated audience on attitude. The 
latter aspect of this study is not reported here 
because the results (which did not confirm any 
simple relationship of attitude change to antici- 
pated audience) call for further research before 
they can be interpreted confidently 


METHOD 


Chis experiment, then, was based on the supposition 
that a large percentage of the communications we re- 
ceive during life are such as to call up an anticipated 
audience 
to whom we think we may have to retransmit the mes- 


sage, or with whom we may want to talk it over, or with 


that is, some reference group or individual 


whom we may conduct an imaginary conversation 
mcerning it. Given that situation, it was hypothesized 
that a person tends to remember the parts of a received 
communication which he believes will appeal to the 
nterests or be consonant with the relevant attitudes of 
the anticipated audience 
In order to test this hypothesis, approximately 200 
students were divided into three 


Stanford speec h 


groups. A control group received attitude tests as be 
low, but no experimental communications. The other 
two groups were given a two-sided set of arguments for 
and against lowering the voting age to 18. These argu 
ments were equal in number for each side and were 
matched by judges. These test groups were told that 
they were to prepare speeches in competition for a 
prize. One test group was told that the prize was to be 
given by “The Society for Constitutional Government 

dedicated to preserving the U. S. Constitution un 
changed” and hence opposed to any change in voting 
age; the other that the prize was to be given by “The 
If they 


can fight, they can vote.” To reinforce this setting, the 


Association for Lowering the Voting Age to 18 


name of the mythical organization for which the student 
was expected to write and speak appeared on the page 
with the arguments, although the same set of arguments 


was used under both names 


TABLE 1 
RECALL OF ARGUMENTS IN RELATION TO AN 
ANTICIPATED AUDIENCE 


Number 

of Argu- 

ments 

Recalled | Recalled | 
‘or Against | 

(Mean) (Mean) | 


| Differ- 


ence 


Audience For Low- 
ering Voting Age 
(N = 76) 

Audience Against 
Lowering Voting 
Age (V = 80) 


DA) equals 3.98; » equals .0008 


CR for net difference (DF 


2-tailed test 


An attitude test was administered first. Then the 
students v ere told about the prize contest, given the 
argumenis to read, and told to remember them because 
they would be useful later when they would write the 
speech. One week later, the teacher administered a 
memory test on the arguments, and also repeated the 
attitude test. Later the students did write speeches on 
this subject, and small prizes were given for the best 
ones. 

The groups were randomly assigned and found to 
be closely equated on the obviously relevant mieas- 
ures—age, year in college, sex, and major subject 

RESULTS 


It was found that the subjects remembered 
significantly more arguments on the side they 
were supposed to address, as Table 1 shows. 
Thus it would appear that the recall of argu- 
ments from a two-sided group of arguments is 
significantly greater after one week for those 
arguments which agree with the supposed position 
of an audience with which the subject expects to 
have to discuss the topic. The hypothesis, and the 
Zimmerman-Bauer conclusion (3) must be re- 
garded as confirmed. 
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ERRATA 


In the article, “The Relationship Between Advanced Age ‘and Mental Abilities”, by 
Charles R. Strother, K. Warner Schaie, and Paul Horst, which appeared in the September, 
1957 issue of this Journal (J. Abnorm. Soc. Psychology, 1957, 65, 166-170), the cuts for 
Figs. 1 and 3 were erroneously transposed in printing. The graph labelled Fig. 1 is derived 
from the data shown in Table 4 and represents the corrected T-score values. The graph labelled 
Fig. 3 is derived from the data in Table 1 and shows the wncorrected values. 


In the article by E. H. Schein, W. F. Hill, H. L. Williams, and A. Lubin, “Distinguishing 
Characteristics of Collaborators and Resisters Among American Prisoners of War,” this Jour- 
nal, 1957, 55, 197-201, the final sentence of the first main paragraph on p. 199 should read, 
“No comparisons between resisters and collaborators are shown, as none of these were significant.” 
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